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Prefaces 


Preface to the Second English Edition 

Science has not stood still in the years since the first English edition of this book 
was published. For example, Fermat’s last theorem has been proved, the Poincare 
conjecture is now a theorem, and the Higgs boson has been discovered. Other events 
in science, while not directly related to the contents of a textbook in classical math- 
ematical analysis, have indirectly led the author to learn something new, to think 
over something familiar, or to extend his knowledge and understanding. All of this 
additional knowledge and understanding end up being useful even when one speaks 
about something apparently completely unrelated. 1 

In addition to the original Russian edition, the book has been published in En- 
glish, German, and Chinese. Various attentive multilingual readers have detected 
many errors in the text. Luckily, these are local errors, mostly misprints. They have 
assuredly all been corrected in this new edition. 

But the main difference between the second and first English editions is the addi- 
tion of a series of appendices to each volume. There are six of them in the first and 
five of them in the second. So as not to disturb the original text, they are placed at the 
end of each volume. The subjects of the appendices are diverse. They are meant to be 
useful to students (in mathematics and physics) as well as to teachers, who may be 
motivated by different goals. Some of the appendices are surveys, both prospective 
and retrospective. The final survey contains the most important conceptual achieve- 
ments of the whole course, which establish connections between analysis and other 
parts of mathematics as a whole. 


'There is a story about Erdos, who, like Hadamard, lived a very long mathematical and human 
life. When he was quite old, a journalist who was interviewing him asked him about his age. Erdos 
replied, after deliberating a bit, "I remember that when I was very young, scientists established that 
the Earth was two billion years old. Now scientists assert that the Earth is four and a half billion 
years old. So. I am approximately two and a half billion years old.” 
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I was happy to learn that this book has proven to be useful, to some extent, not 
only to mathematicians, but also to physicists, and even to engineers from technical 
schools that promote a deeper study of mathematics. 

It is a real pleasure to see a new generation that thinks bigger, understands more 
deeply, and is able to do more than the generation on whose shoulders it grew. 

Moscow, Russia V. Zorich 

2015 


Preface to the First English Edition 

An entire generation of mathematicians has grown up during the time between the 
appearance of the first edition of this textbook and the publication of the fourth 
edition, a translation of which is before you. The book is familiar to many people, 
who either attended the lectures on which it is based or studied out of it, and who 
now teach others in universities all over the world. I am glad that it has become 
accessible to English-speaking readers. 

This textbook consists of two parts. It is aimed primarily at university students 
and teachers specializing in mathematics and natural sciences, and at all those who 
wish to see both the rigorous mathematical theory and examples of its effective use 
in the solution of real problems of natural science. 

The textbook exposes classical analysis as it is today, as an integral part of Mathe- 
matics in its interrelations with other modern mathematical courses such as algebra, 
differential geometry, differential equations, complex and functional analysis. 

The two chapters with which this second book begins, summarize and explain in 
a general form essentially all most important results of the first volume concerning 
continuous and differentiable functions, as well as differential calculus. The pres- 
ence of these two chapters makes the second book formally independent of the first 
one. This assumes, however, that the reader is sufficiently well prepared to get by 
without introductory considerations of the first part, which preceded the resulting 
formalism discussed here. This second book, containing both the differential calcu- 
lus in its generalized form and integral calculus of functions of several variables, 
developed up to the general formula of Newton-Leibniz-Stokes, thus acquires a 
certain unity and becomes more self-contained. 

More complete information on the textbook and some recommendations for its 
use in teaching can be found in the translations of the prefaces to the first and second 
Russian editions. 


Moscow, Russia 
2003 


V. Zorich 
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Preface to the Sixth Russian Edition 

On my own behalf and on behalf of future readers, I thank all those, living in dif- 
ferent countries, who had the possibility to inform the publisher or me personally 
about errors (typos, errors, omissions), found in Russian, English, German and Chi- 
nese editions of this textbook. 

As it turned out, the book has been also very useful to physicists; I am very 
happy about that. In any case, I really seek to accompany the formal theory with 
meaningful examples of its application both in mathematics and outside of it. 

The sixth edition contains a series of appendices that may be useful to students 
and lecturers. Firstly, some of the material is actually real lectures (for example, 
the transcription of two introductory survey lectures for students of first and third 
semesters), and, secondly, this is some mathematical information (sometimes of cur- 
rent interest, such as the relation between multidimensional geometry and the theory 
of probability), lying close to the main subject of the textbook. 

Moscow, Russia V. Zorich 

2011 


Prefaces to the Fifth, Fourth, Third and Second Russian Editions 

In the fifth edition all misprints of the fourth edition have been corrected. 

Moscow, Russia V. Zorich 

2006 

In the fourth edition all misprints that the author is aware of have been corrected. 

Moscow, Russia V. Zorich 

2002 

The third edition differs from the second only in local corrections (although in 
one case it also involves the correction of a proof) and in the addition of some 
problems that seem to me to be useful. 

Moscow, Russia V. Zorich 

2001 

In addition to the correction of all the misprints in the first edition of which the 
author is aware, the differences between the second edition and the first edition of 
this book are mainly the following. Certain sections on individual topics - for ex- 
ample, Fourier series and the Fourier transform - have been recast (for the better, 
I hope). We have included several new examples of applications and new substantive 
problems relating to various parts of the theory and sometimes significantly extend- 
ing it. Test questions are given, as well as questions and problems from the midterm 
examinations. The list of further readings has been expanded. 
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Further information on the material and some characteristics of this second part 
of the course are given below in the preface to the first edition. 

Moscow, Russia V. Zorich 

1998 


Preface to the First Russian Edition 

The preface to the first part contained a rather detailed characterization of the course 
as a whole, and hence I confine myself here to some remarks on the content of the 
second part only. 

The basic material of the present volume consists on the one hand of multiple in- 
tegrals and line and surface integrals, leading to the generalized Stokes’ formula and 
some examples of its application, and on the other hand the machinery of series and 
integrals depending on a parameter, including Fourier series, the Fourier transform, 
and the presentation of asymptotic expansions. 

Thus, this Part 2 basically conforms to the curriculum of the second year of study 
in the mathematics departments of universities. 

So as not to impose rigid restrictions on the order of presentation of these two 
major topics during the two semesters, I have discussed them practically indepen- 
dently of each other. 

Chapters 9 and 10, with which this book begins, reproduce in compressed and 
generalized form, essentially all of the most important results that were obtained 
in the first part concerning continuous and differentiable functions. These chapters 
are starred and written as an appendix to Part 1 . This appendix contains, however, 
many concepts that play a role in any exposition of analysis to mathematicians. 
The presence of these two chapters makes the second book formally independent 
of the first, provided the reader is sufficiently well prepared to get by without the 
numerous examples and introductory considerations that, in the first part, preceded 
the formalism discussed here. 

The main new material in the book, which is devoted to the integral calculus of 
several variables, begins in Chap. 1 1 . One who has completed the first part may 
begin the second part of the course at this point without any loss of continuity in the 
ideas. 

The language of differential forms is explained and used in the discussion of the 
theory of line and surface integrals. All the basic geometric concepts and analytic 
constructions that later form a scale of abstract definitions leading to the generalized 
Stokes’ formula are first introduced by using elementary material. 

Chapter 15 is devoted to a similar summary exposition of the integration of dif- 
ferential forms on manifolds. I regard this chapter as a very desirable and system- 
atizing supplement to what was expounded and explained using specific objects in 
the mandatory Chaps. 1 1-14. 

The section on series and integrals depending on a parameter gives, along with 
the traditional material, some elementary information on asymptotic series and 
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asymptotics of integrals (Chap. 19), since, due to its effectiveness, the latter is an 
unquestionably useful piece of analytic machinery. 

For convenience in orientation, ancillary material or sections that may be omitted 
on a first reading, are starred. 

The numbering of the chapters and figures in this book continues the numbering 
of the first part. 

Biographical information is given here only for those scholars not mentioned in 
the first part. 

As before, for the convenience of the reader, and to shorten the text, the end of a 
proof is denoted by □. Where convenient, definitions are introduced by the special 
symbols := or =: (equality by definition), in which the colon stands on the side of 
the object being defined. 

Continuing the tradition of Part 1, a great deal of attention has been paid to both 
the lucidity and logical clarity of the mathematical constructions themselves and the 
demonstration of substantive applications in natural science for the theory devel- 
oped. 

Moscow, Russia V. Zorich 

1982 
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Chapter 9 

^Continuous Mappings (General Theory) 


In this chapter we shall generalize the properties of continuous mappings established 
earlier for numerical-valued functions and mappings of the type / : R m -* R" and 
discuss them from a unified point of view. In the process we shall introduce a number 
of simple, yet important concepts that are used everywhere in mathematics. 


9.1 Metric Spaces 

9.1.1 Definition and Examples 

Definition 1 A set X is said to be endowed with a metric or a metric space structure 
or to be a metric space if a function 

<i:XxX->R (9.1) 

is exhibited satisfying the following conditions: 

a) d(x\, X2) — 0 <£> x\ = X2, 

b) d(x\, X2) = d(x2, Xi) (symmetry), 

c) d (xi , X3) < d{x\ , X2) + d (X2 , X3) (the triangle inequality), 
where x1.x2.x3 are arbitrary elements of X. 

In that case, the function (9.1) is called a metric or distance on X. 

Thus a metric space is a pair {X, d) consisting of a set X and a metric defined on 
it. 

In accordance with geometric terminology the elements of X are called points. 
We remark that if we set X 3 = xi in the triangle inequality and take account of 
conditions a) and b) in the definition of a metric, we find that 

0 < d(xi. X 2 ), 

that is, a distance satisfying axioms a), b), and c) is nonnegative. 
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Let us now consider some examples. 

Example 1 The set R of real numbers becomes a metric space if we set d{x\ , x 2 ) = 
\x 2 — x\ | for any two numbers xi and x 2 , as we have always done. 

Example 2 Other metrics can also be introduced on M. A trivial metric, for example, 
is the discrete metric in which the distance between any two distinct points is 1 . 

The following metric on M is much more substantive. Let x fix) be a nonneg- 
ative function defined for x > 0 and vanishing for x = 0. If this function is strictly 
convex upward, then, setting 


d(x i,x 2 ) = f(\x\ -x 2 \) 


(9.2) 


for points x \ , x 2 e R, we obtain a metric on R. 

Axioms a) and b) obviously hold here, and the triangle inequality follows from 
the easily verified fact that / is strictly monotonic and satisfies the following in- 
equalities for 0 < a < b: 

f(a + b)~ fib) < fia) - f{ 0) = fia). 

In particular, one could set d(x i,x 2 ) = *J\x\ — x 2 \ or d(x \ , x 2 ) — jqrj^-z^y- In 
the latter case the distance between any two points of the line is less than 1 . 

Example 3 Besides the traditional distance 


d(x \,x 2 ) 


n 



\l «'=i 


(9.3) 


between points x\ = (xj, . . . , x ") and x 2 = {x\, . . . , x'f) in R", one can also intro- 
duce the distance 


d p {x i,x 2 ) 



(9.4) 


where p > 1. The validity of the triangle inequality for the function (9.4) follows 
from Minkowski’s inequality (see Sect. 5.4.2). 


Example 4 When we encounter a word with incorrect letters while reading a text, 
we can reconstruct the word without too much trouble by correcting the errors, 
provided the number of errors is not too large. However, correcting the error and 
obtaining the word is an operation that is sometimes ambiguous. For that reason, 
other conditions being equal, one must give preference to the interpretation of the 
incorrect text that requires the fewest corrections. Accordingly, in coding theory 
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the metric (9.4) with p = 1 is used on the set of all finite sequences of length n 
consisting of zeros and ones. 

Geometrically the set of such sequences can be interpreted as the set of ver- 
tices of the unit cube I = e M" | 0 < x l < 1, i = 1 n] in R". The distance 

between two vertices is the number of interchanges of zeros and ones needed to 
obtain the coordinates of one vertex from the other. Each such interchange repre- 
sents a passage along one edge of the cube. Thus this distance is the shortest path 
along the edges of the cube from one of the vertices under consideration to the 
other. 

Example 5 In comparing the results of two series of n measurements of the same 
quantity the metric most commonly used is (9.4) with p — 2. The distance between 
points in this metric is usually called their mean-square deviation. 

Example 6 As one can easily see, if we pass to the limit in (9.4) as p — »■ +oo, we 
obtain the following metric in R” : 

d(x ] , xi) = max \x\ —xi,\. (9.5) 

1 <; <n - 1 

Example 7 The set C[a,b] of functions that are continuous on a closed interval 
becomes a metric space if we define the distance between two functions / and g to 
be 

d(f,g)= max \f(x)-g(x)\. (9.6) 

a<x<b 

Axioms a) and b) for a metric obviously hold, and the triangle inequality follows 
from the relations 

\ f(x)-h(x)\ < \ f(x)- g{x)\ + \g{x)-h{x)\ < d(f, g) + d(g, h), 


that is, 

d(f, h) = max \ f(x)-h(x)\<d(f,g) + d(g,h). 
a<x<b 

The metric (9.6) - the so-called uniform or Chebyshev metric in C[a,b] - is 
used when we wish to replace one function by another (for example, a polynomial) 
from which it is possible to compute the values of the first with a required degree of 
precision at any point x e [« , /;] . The quantity d(f, g) is precisely a characterization 
of the precision of such an approximate computation. 

The metric (9.6) closely resembles the metric (9.5) in R". 

Example 8 Like the metric (9.4), for p > 1 we can introduce in C[a, /;] the metric 


d P (f,g)=(^J \f-g\ p (x)dx^J 


(9.7) 
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It follows from Minkowski’s inequality for integrals, which can be obtained from 
Minkowski’s inequality for the Riemann sums by passing to the limit, that this is 
indeed a metric for p > 1 . 

The following special cases of the metric (9.7) are especially important: p = 1, 
which is the integral metric; p — 2, the metric of mean-square deviation; and p — 
+oo, the uniform metric. 

The space C[a, b] endowed with the metric (9.7) is often denoted C p [a , b\. One 
can verify that Coo [a, b] is the space C[a, b] endowed with the metric (9.6). 

Example 9 The metric (9.7) could also have been used on the set lZ[a,b] of 
Riemann-integrable functions on the closed interval [a,b]. However, since the inte- 
gral of the absolute value of the difference of two functions may vanish even when 
the two functions are not identically equal, axiom a) will not hold in this case. Nev- 
ertheless, we know that the integral of a nonnegative function <p e 77. [a, b] equals 
zero if and only if cp(x ) = 0 at almost all points of the closed interval [a,b]. 

Therefore, if we partition TZ[a, b] into equivalence classes of functions, regarding 
two functions in TZ[a, b] as equivalent if they differ on at most a set of measure zero, 
then the relation (9.7) really does define a metric on the set lZ[a, b] of such equiv- 
alence classes. The set lZ[a, b] endowed with this metric will be denoted 1Z P [a . b] 
and sometimes simply by 7 Z p [a, b]. 

Example 10 In the set C {k) [a, b] of functions defined on [a, b\ and having continu- 
ous derivatives up to order k inclusive one can define the following metric: 

d(f,g) = max{M 0 ,...,M k }, (9.8) 

where 

Mj = max \f('\x) — £^(jc)|, i=0,l,...,k. 

a<x<b 

Using the fact that (9.6) is a metric, one can easily verify that (9.8) is also a 
metric. 

Assume for example that / is the coordinate of a moving point considered 
as a function of time. If a restriction is placed on the allowable region where 
the point can be during the time interval [a,b] and the particle is not allowed 
to exceed a certain speed, and, in addition, we wish to have some assurance 
that the accelerations cannot exceed a certain level, it is natural to consider the 
set {max a < x < b \f(x)\,maxa< x <b\f'(x)\,max a < x <b\f"(x)\} for a function / e 
C t2) [a, b\ and using these characteristics, to regard two motions / and g as close 
together if the quantity (9.8) for them is small. 

These examples show that a given set can be metrized in various ways. The 
choice of the metric to be introduced is usually controlled by the statement of the 
problem. At present we shall be interested in the most general properties of metric 
spaces, the properties that are inherent in all of them. 
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9.1.2 Open and Closed Subsets of a Metric Space 


Let (X, d) be a metric space. In the general case, as was done for the case X — R'' 
in Sect. 7.1, one can also introduce the concept of a ball with center at a given point, 
open set, closed set, neighborhood of a point, limit point of a set, and so forth. 

Let us now recall these concepts, which are basic for what is to follow. 

Definition 2 For 8 > 0 and a e X the set 

B(a , 5) = {.r e X | d(ci, x) < <5} 

is called the ball with center a e X of radius 8 or the 8-neighborhood of the point a. 

This name is a convenient one in a general metric space, but it must not be iden- 
tified with the traditional geometric image we are familiar with in R 3 . 

Example 11 The unit ball in C[a, b] with center at the function that is identically 0 
on [a, b] consists of the functions that are continuous on the closed interval [ a , /;] 
and whose absolute values are less than 1 on that interval. 

Example 12 Let X be the unit square in R 2 for which the distance between two 
points is defined to be the distance between those same points in R 2 . Then X is a 
metric space, while the square X considered as a metric space in its own right can 
be regarded as the ball of any radius p > s/2/2 about its center. 

It is clear that in this way one could construct balls of very peculiar shape. Hence 
the term ball should not be understood too literally. 

Definition 3 A set G C X is open in the metric space (X, d) if for each point x e G 
there exists a ball B(x, 8) such that B{x, 8) C G. 

It obviously follows from this definition that X itself is an open set in ( X,d ). 
The empty set 0 is also open. By the same reasoning as in the case of R" one can 
prove that a ball B(a. r) and its exterior [x e X : d{a, x) > r} are open sets. (See 
Examples 3 and 4 of Sect. 7.1.) 

Definition 4 A set F C X is closed in ( X , d) if its complement X\F is open in 
(X.d). 

In particular, we conclude from this definition that the closed ball 
B(a, r ) {.r e X \ cl(a, x) < r} 
is a closed set in a metric space ( X , d). 

The following proposition holds for open and closed sets in a metric space (X, d). 
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Proposition 1 a) The union UasA °f the sets hi an y system { G a , a e A} of sets 
G a th at are open in X is an open set in X. 

b) The intersection p]” =1 G; of any finite number of sets that are open in X is an 
open set in X. 

a') The intersection flasA of the sets in any system {F a ,a e A} of sets F a 

that are closed in X is a closed set in X. 

b') The union U/=i F of any finite number of sets that are closed in X is a closed 
set in X. 

The proof of Proposition lisa verbatim repetition of the proof of the correspond- 
ing proposition for open and closed sets in R" , and we omit it. (See Proposition 1 in 
Sect. 7.1.) 

Definition 5 An open set in X containing the point x e X is called a neighborhood 
of the point x in X . 

Definition 6 Relative to a set E C X, a point x e A is called 

an interior point of E if some neighborhood of it is contained in X, 
an exterior point of E if some neighborhood of it is contained in the complement 
of E in X , 

a boundary point of E if it is neither interior nor exterior to E (that is, every 
neighborhood of the point contains both a point belonging to E and a point not 
belonging to E). 

Example 13 All points of a ball B(a, r) are interior to it, and the set CxB(a, r) — 
X\B(a, r) consists of the points exterior to the ball B(a. r). 

In the case of JR" with the standard metric d the sphere S(a,r ) := {x e R" | 
d(a. x) — r > 0} is the set of boundary points of the ball B(a, r). 1 

Definition 7 A point a e X is a limit point of the set E C X if the set E fi O (a) is 
infinite for every neighborhood 0(a) of the point. 

Definition 8 The union of the set E and the set of all its limit points is called the 
closure of the set £ in A. 

As before, the closure of a set E C X will be denoted E. 

Proposition 2 A set F C X is closed in X if and only if it contains all its limit 
points. 

Thus 

( F is closed in X) <(=)> (F = F in X). 


An connection with Example 13 see also Problem 2 at the end of this section. 
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We omit the proof, since it repeats the proof of the analogous proposition for the 
case X — R" discussed in Sect. 7.1. 


9.1.3 Subspaces of a Metric Space 


If (X, d ) is a metric space and E is a subset of X , then, setting the distance between 
two points x\ and xz of E equal to d(x\ , xf), that is, the distance between them 
in X, we obtain the metric space ( E, d), which is customarily called a subspace of 
the original space ( X , d). 

Thus we adopt the following definition. 

Definition 9 A metric space {X\,d\) is a subspace of the metric space (X, d) if 
X[ C X and the equality di(a, b) — d(a, b) holds for any pair of points a,b it\X\. 

Since the ball B\(a,r) = {x e Xi | d\(a,x) < r] in a subspace (Xi,r/j) of the 
metric space (X, d) is obviously the intersection 


B\ (a, r) = Xi D B(ci, r) 


of the set X\ C X with the ball B(a. r) in X , it follows that every open set in X \ 
has the form 


G i = Xi !~lG, 

where G is an open set in X, and every closed set F\ in Xj has the form 


Fi = Xi n F, 


where F is a closed set in X. 

It follows from what has just been said that the properties of a set in a metric 
space of being open or closed are relative properties and depend on the ambient 
space. 

Example 14 The open interval \x\ < 1, y — 0 of the v-axis in the plane M 2 with the 
standard metric in R 2 is a metric space (X] , d\), which, like any metric space, is 
closed as a subset of itself, since it contains all its limit points in X\. At the same 
time, it is obviously not closed in R 2 = X. 

This same example shows that openness is also a relative concept. 

Example 15 The set C[a, b] of continuous functions on the closed interval [a, b] 
with the metric (9.7) is a subspace of the metric space lZ p [a, b\. However, if we 
consider the metric (9.6) on C[a, b] rather than (9.7), this is no longer true. 
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9.1.4 The Direct Product of Metric Spaces 

If (Xi, d\ ) and (Xj, di) are two metric spaces, one can introduce a metric d on the 
direct product X i x Xi. The commonest methods of introducing a metric in X \ x X2 
are the following. If (x\ , X2) e X 1 x X2 and (x\,x'f) e X\ x X2, one may set 



or 


d((x \ , X2), (x \ , x' 2 )) =d\(x\,x 1) +^2(^2. ^2)’ 


or 


r/((xi , X2), (x[, x'fj) = max{di(xi, x' 2 ), d2(x2 , x^)}- 


It is easy to see that we obtain a metric onI| x Xi in all of these cases. 

Definition 10 if (X[,di) and (X2, di) are two metric spaces, the space ( X 1 x 
X2 ,d), where d is a metric on X \ x Xi introduced by any of the methods just 
indicated, will be called the direct product of the original metric spaces. 

Example 16 The space R 2 can be regarded as the direct product of two copies of 
the metric space M with its standard metric, and M 3 is the direct product R 2 x R 1 of 
the spaces R 2 and R 1 = R. 


9.1.5 Problems and Exercises 

1. a) Extending Example 2, show that if / : R + — »■ R + is a continuous function 
that is strictly convex upward and satisfies /( 0) = 0, while (X, d) is a metric space, 
then one can introduce a new metric df on X by setting df(x\, X2) = f (d(x \ , .*2))- 

b) Show that on any metric space ( X , d) one can introduce a metric d' (x \ , ax) = 
1 +d(x* 2 x-i) w h' c h ^e distance between the points will be less than 1 . 

2 . Let (X, d) be a metric space with the trivial ( discrete ) metric shown in Ex- 
ample 2, and let a e X. For this case, what are the sets B(a, 1/2), B{a, 1 ), 
B(a, 1), B{a , 1), B(a, 3/2), and what are the sets {x e X \ d(a, x) = 1/2}, {xeX| 
d(a,x ) = 1 }, B(a, 1 )\B(a. 1 ), B(a, 1 )\B(a, 1 )? 

3 . a) Is it true that the union of any family of closed sets is a closed set? 

b) Is every boundary point of a set a limit point of that set? 

c) Is it true that in any neighborhood of a boundary point of a set there are points 
in both the interior and exterior of that set? 

d) Show that the set of boundary points of any set is a closed set. 

4 . a) Prove that if (Y. dy) is a subspace of the metric space (X, dy ), then for any 
open (resp. closed) set Gy (resp. Fy) in Y there is an open (resp. closed) set Gx 
(resp. Fx) in X such that Gy — Y fl Gx, (resp. Fy — Y C I Fy). 
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b) Verify that if the open sets G' y and G y in Y do not intersect, then the corre- 
sponding sets G' x and G" x in X can be chosen so that they also have no points in 
common. 

5. Having a metric d on a set X, one may attempt to define the distance d (A . B) 
between sets A C X and B C V as follows: 

d(A,B)— inf d(a.b). 

aeA.beB 

a) Give an example of a metric space and two nonintersecting subsets of it A 
and B for which d(A, B) = 0. 

b) Show, following Hausdorff, that on the set of closed sets of a metric space 
( X , d) one can introduce the Hausdorff metric D by assuming that for A Cl X and 
BcX 

D(A , B) max] sup<f(a, B), sup<f(A, b ) |. 

asA beB ’ 


9.2 Topological Spaces 

For questions connected with the concept of the limit of a function or a mapping, 
what is essential in many cases is not the presence of any particular metric on the 
space, but rather the possibility of saying what a neighborhood of a point is. To 
convince oneself of that it suffices to recall that the very definition of a limit or the 
definition of continuity can be stated in terms of neighborhoods. Topological spaces 
are the mathematical objects on which the operation of passage to the limit and the 
concept of continuity can be studied in maximum generality. 


9.2.1 Basic Definitions 

Definition 1 A set X is said to be endowed with the structure of a topological space 
or a topology or is said to be a topological space if a system r of subsets of X is 
exhibited (called open sets in X) possessing the following properties: 

a) 0 G X : X G T . 

b) (Va G A(t a G r)) => U ff€ A t « e T - 

c) (r / Gt;i = l,...,n) =>■ H" = i V e r. 

Thus, a topological space is a pair (A, r) consisting of a set X and a system r 
of distinguished subsets of the set having the properties that r contains the empty 
set and the whole set X, the union of any number of sets of r is a set of r, and the 
intersection of any finite number of sets of r is a set of r . 

As one can see, in the axiom system a), b), c) for a topological space we have 
postulated precisely the properties of open sets that we already proved in the case of 
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a metric space. Thus any metric space with the definition of open sets given above 
is a topological space. 

Thus defining a topology on X means exhibiting a system r of subsets of X 
satisfying the axioms a), b), and c) for a topological space. 

Defining a metric in X, as we have seen, automatically defines the topology on X 
induced by that metric. It should be remarked, however, that different metrics on X 
may generate the same topology on that set. 

Example 1 Let X — M" (n > 1). Consider the metric d\ (X ] , xf) defined by relation 
(9.5) in Sect. 9.1, and the metric d. 2 (x \ , X 2 ) defined by formula (9.3) in Sect. 9.1. 
The inequalities 


d\(x\, X2) < d2{x\,X2) < *Jnd\(x\,X2), 

obviously imply that every ball B(a,r) with center at an arbitrary point a e X, 
interpreted in the sense of one of these two metrics, contains a ball with the same 
center, interpreted in the sense of the other metric. Hence by definition of an open 
subset of a metric space, it follows that the two metrics induce the same topology 
on X. 

Nearly all the topological spaces that we shall make active use of in this course 
are metric spaces. One should not think, however, that every topological space can 
be metrized, that is, endowed with a metric whose open sets will be the same as 
the open sets in the system r that defines the topology on X. The conditions under 
which this can be done form the content of the so-called metrization theorems. 

Definition 2 If (X, r) is a topological space, the sets of the system r are called the 
open sets, and their complements in X are called the closed sets of the topological 
space (X, r). 

A topology r on a set X is seldom defined by enumerating all the sets in the 
system r. More often the system r is defined by exhibiting only a certain set of 
subsets of X from which one can obtain any set in the system r through union and 
intersection. The following definition is therefore very important. 

Definition 3 A base of the topological space (X, r) (an open base or base for the 
topology) is a family 23 of open subsets of X such that every open set G e r is the 
union of some collection of elements of the family *8 . 

Example 2 If (X, d) is a metric space and (x, r) the topological space correspond- 
ing to it, the set 23 = { B(a, r)} of all balls, where a <= X and r > 0, is obviously a 
base of the topology r . Moreover, if we take the system 23 of all balls with positive 
rational radii r, this system is also a base for the topology. 

Thus a topology can be defined by describing only a base of that topology. As 
one can see from Example 2, a topological space may have many different bases for 
the topology. 
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Definition 4 The minimal cardinality among all bases of a topological space is 
called its weight. 

As a rule, we shall be dealing with topological spaces whose topologies admit a 
countable base (see, however, Problems 4 and 6). 

Example 3 If we take the system *8 of balls in R A ' of all possible rational radii r = 
™ > 0 with centers at all possible rational points (^, . . . , e R*, we obviously 
obtain a countable base for the standard topology of R k . It is not difficult to verify 
that it is impossible to define the standard topology in R* by exhibiting a finite 
system of open sets. Thus the standard topological space IR/ has countable weight. 

Definition 5 A neighborhood of a point of a topological space ( X , r) is an open set 
containing the point. 

It is clear that if a topology r is defined on X, then for each point the system of 
its neighborhoods is defined. 

It is also clear that the system of all neighborhoods of all possible points of 
topological space can serve as a base for the topology of that space. Thus a topology 
can be introduced on X by describing the neighborhoods of the points of X. This is 
the way of defining the topology in X that was originally used in the definition of a 
topological space. 2 Notice, for example, that we have introduced the topology in a 
metric space itself essentially by saying what a 8 -neighborhood of a point is. Let us 
give one more example. 

Example 4 Consider the set C(R, R) of real-valued continuous functions defined 
on the entire real line. Using this set as foundation, we shall construct a new set 
- the set of germs of continuous functions. We shall regard two functions /, g e 
C(R, R) as equivalent at the point a e R if there is a neighborhood U (a) of that 
point such that Vx e U (a) ( f(x ) = g(x)). The relation just introduced really is an 
equivalence relation (it is reflexive, symmetric, and transitive). An equivalence class 
of continuous functions at the point a e R is called germ of continuous function at 
that point. If / is one of the functions generating the germ at the point a, we shall 
denote the germ itself by the symbol f a . Now let us define a neighborhood of a 
germ. Let U (a) be a neighborhood of the point a and / a function defined on U (a) 
generating the germ f a at a . This same function generates its germ f x at any point 
x e U{a). The set {f x } of all germs corresponding to the points x e U(a) will be 
called a neighborhood of the germ f a . Taking the set of such neighborhoods of all 
germs as the base of a topology, we turn the set of germs of continuous functions 
into a topological space. It is worthwhile to note that in the resulting topological 


2 The concepts of a metric space and a topological space were explicitly stated early in the twen- 
tieth century. In 1906 the French mathematician M. Frechet (1878-1973) introduced the concept 
of a metric space, and in 1914 the German mathematician F. Flausdorff (1868-1942) defined a 
topological space. 
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Fig. 9.1 



x 


space two different points (germs) f a and g a may not have disjoint neighborhoods 
(see Fig. 9.1). 

Definition 6 A topological space is Hctusdorff if the Hausdorff axiom holds in it: 
any two distinct points of the space have nonintersecting neighborhoods. 

Example 5 Any metric space is obviously Hausdorff, since for any two points 
a,b e X such that d(a,b ) > 0 their spherical neighborhoods It (a, \d(a, b)), 
B(b , ^d(a, h)) have no points in common. 

At the same time, as Example 4 shows, there do exist non-Hausdorff topological 
spaces. Perhaps the simplest example here is the topological space ( X , r) with the 
trivial topology r = {0, X}. If X contains at least two distinct points, then (X, r) is 
obviously not Hausdorff. Moreover, the complement X\x of a point in this space is 
not an open set. 

We shall be working exclusively with Hausdorff spaces. 

Definition 7 A set E C X is ( everywhere ) dense in a topological space ( X , r) if 
for any point x e X and any neighborhood U (x) of it the intersection E DU (A) is 
nonempty. 

Example 6 If we consider the standard topology in R, the set Q of rational numbers 
is everywhere dense in R. Similarly the set Q" of rational points in R" is dense 
in R" . 


One can show that in every topological space there is an everywhere dense set 
whose cardinality does not exceed the weight of the topological space. 

Definition 8 A metric space having a countable dense set is called a separable 
space. 

Example 7 The metric space (R" ,d) in any of the standard metrics is a separable 
space, since Q" is dense in it. 

Example 8 The metric space (C([0, 1], R), d) with the metric defined by (9.6) 
is also separable. For, as follows from the uniform continuity of the functions 
/ e C([0, 1], R), the graph of any such function can be approximated as closely 
as desired by a broken line consisting of a finite number of segments whose nodes 
have rational coordinates. The set of such broken lines is countable. 


9.2 Topological Spaces 
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We shall be dealing mainly with separable spaces. 

We now remark that, since the definition of a neighborhood of a point in a topo- 
logical space is verbally the same as the definition of a neighborhood of a point in 
a metric space, the concepts of interior point , exterior point , boundary point, and 
limit point of a set, and the concept of the closure of a set, all of which use only the 
concept of a neighborhood, can be carried over without any changes to the case of 
an arbitrary topological space. 

Moreover, as can be seen from the proof of Proposition 2 in Sect. 7.1, it is also 
true that a set in a Hausdorff space is closed if and only if it contains all its limit 
points. 


9.2.2 Subspaces of a Topological Space 

Let ( X , Tx) be a topological space and Y a subset of X. The topology tx makes 
it possible to define the following topology Ty in Y , called the induced or relative 
topology on Y <zX. 

We define an open set in Y to be any set Gy of the form Gy = Y D G x , where 
Gx is an open set in X. 

It is not difficult to verify that the system ty of subsets of Y that arises in this 
way satisfies the axioms for open sets in a topological space. 

As one can see, the definition of open sets Gy in f agrees with the one we 
obtained in Sect. 9.1.3 for the case when Y is a subspace of a metric space X. 

Definition 9 A subset Y C X of a topological space ( X , r) with the topology xy 
induced on Y is called a subspace of the topological space X. 

It is clear that a set that is open in (7, Ty) is not necessarily open in (X, ty ). 


9.2.3 The Direct Product of Topological Spaces 

If (AT, ti) and (AT, T 2 ) are two topological spaces with systems of open sets tj = 
{Gi} and T 2 = {G 2 }, we can introduce a topology on X \ x AT by taking as the base 
the sets of the form G 1 x G 2 . 

Definition 10 The topological space (AT x AT, t\ x xt) whose topology has the 
base consisting of sets of the form G 1 x G 2 , where G, is an open set in the topo- 
logical space (Ay, t,), i = 1, 2, is called the direct product of the topological spaces 
(-XT, ri) and (AT, r 2 ). 


Example 9 If R = R 1 and R 2 are considered with their standard topologies, then, 
as one can see, R 2 is the direct product R 1 x R 1 . For every open set in R 2 can be 
obtained, for example, as the union of “square” neighborhoods of all its points. And 
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squares (with sides parallel to the axes) are the products of open intervals, which are 
open sets in R. 

It should be noted that the sets G\ x G 2 , where G 1 e r\ and Gj € *2, constitute 
only a base for the topology, not all the open sets in the direct product of topological 
spaces. 


9.2.4 Problems and Exercises 

1. Verify that if ( X , d) is a metric space, then (V, y^y) is also a metric space, and 
the metrics d and -yyj induce the same topology on X. (See also Problem 1 of the 
preceding section.) 

2. a) In the set N of natural numbers we define a neighborhood of the number 
n e N to be an arithmetic progression with difference d relatively prime to n . Is the 
resulting topological space Hausdorff? 

b) What is the topology of N, regarded as a subset of the set R of real numbers 
with the standard topology? 

c) Describe all open subsets of R. 

3. If two topologies x\ and xi are defined on the same set, we say that X 2 is stronger 
than t| if x\ C X 2 , that is X 2 contains all the sets in rj and some additional open sets 
not in x\ . 

a) Are the two topologies on N considered in the preceding problem compara- 
ble? 

b) If we introduce a metric on the set C[0, 1] of continuous real-valued functions 
defined on the closed interval [0, 1] first by relation (9.6) of Sect. 9.1, and then by 
relation (9.7) of the same section, two topologies generally arise on C[a , b]. Are 
they comparable? 

4 . a) Prove in detail that the space of germs of continuous functions defined in 
Example 4 is not Hausdorff. 

b) Explain why this topological space is not metrizable. 

c) What is the weight of this space? 

5. a) State the axioms for a topological space in the language of closed sets. 

b) Verify that the closure of the closure of a set equals the closure of the set. 

c) Verify that the boundary of any set is a closed set. 

d) Show that if F is closed and G is open in ( X , r), then the set G\F is open in 
(X,x). 

e) If ( Y , x y) is a subspace of the topological space ( X , r), and the set E is such 
that E c Y c X and E e xx, then E e xy. 

6. A topological space (X, x) in which every point is a closed set is called a topo- 
logical space in the strong sense or a ti -space. Verify the following statements. 

a) Every Hausdorff space is a x \ -space (partly for this reason, Hausdorff spaces 
are sometimes called t 2 -spaces). 


9.3 Compact Sets 
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b) Not every ri -space is a 12 -space. (See Example 4.) 

c) The two-point space X — {a, b } with the open sets {0, X} is not a r\ -space. 

d) In a t \ -space a set F is closed if and only if it contains all its limit points. 

7. a) Prove that in any topological space there is an everywhere dense set whose 
cardinality does not exceed the weight of the space. 

b) Verify that the following metric spaces are separable: C[a. b], C (k fa, b], 
lZi[a, b], lZp[a, b] (for the formulas giving the respective metrics see Sect. 9.1). 

c) Verify that if max is replaced by sup in relation (9.6) of Sect. 9. 1 and regarded 
as a metric on the set of all bounded real-valued functions defined on a closed inter- 
val [a, b], we obtain a nonseparable metric space. 


9.3 Compact Sets 

9.3.1 Definition and General Properties of Compact Sets 

Definition 1 A set K in a topological space ( X , r) is compact (or bicompact 3 ) if 
from every covering of K by sets that are open in X one can select a finite number 
of sets that cover K . 

Example 1 An interval [a , b] of the set M of real numbers in the standard topology 
is a compact set, as follows immediately from the lemma of Sect. 2.1.3 asserting 
that one can select a finite covering from any covering of a closed interval by open 
intervals. 

In general an m -dimensional closed interval I m = {x e R" ! a 1 <x‘ <b‘,i= 1 , 

. . . , m] in R”' is a compact set, as was established in Sect. 7.1.3. 

It was also proved in Sect. 7.1.3 that a subset of R"' is compact if and only if it 
is closed and bounded. 

In contrast to the relative properties of being open and closed, the property of 
compactness is absolute, in the sense that it is independent of the ambient space. 
More precisely, the following proposition holds. 

Proposition 1 A subset K of a topological space ( X , r) is a compact subset of X if 
and only if K is compact as a subset of itself with the topology induced from ( X , r). 

Proof This proposition follows from the definition of compactness and the fact that 
every set Gk that is open in K can be obtained as the intersection of K with some 
set Gx that is open in A. □ 


3 The concept of compactness introduced by Definition 1 is sometimes called bicompactness in 
topology. 
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Thus, if (X, Tx) and (Y, Ty) are two topological spaces that induce the same 
topology on K C X IT Y, then K is simultaneously compact or not compact in both 
X and Y. 

Example 2 Let d be the standard metric on R and I — {x e R. | 0 < x < 1} the unit 
interval in R. The metric space (/, d) is closed (in itself) and bounded, but is not a 
compact set, since for example, it is not a compact subset of K. 

We now establish the most important properties of compact sets. 

Lemma 1 (Compact sets are closed) If K is a compact set in a Hausdorff space 
(X, r), then K is a closed subset of X. 

Proof By the criterion for a set to be closed, it suffices to verify that every limit 
point of K, xo e X, belongs to K. 

Suppose xo f K. For each point x e K we construct an open neighborhood G (x ) 
such that xo has a neighborhood disjoint from G(x). The set G(x), x e K, of all 
such neighborhoods forms an open covering of K , from which one can select a 
finite covering G(xi), . . . , G(x„). Now if G/(xo) is a neighborhood of xo such that 
G(x,) (T G/(xo) = 0, the set G(x) = P|" =l G,(xo) is also a neighborhood of xo, and 
G(x, ) IT G(xo) = 0 for all i = 1, . . . , n. But this means that K IT G(xo) = 0, and 
then xo cannot be a limit point for K. □ 

Lemma 2 (Nested compact sets) If K\ D K 2 D ■ ■ ■ D K n D ■ ■ ■ is a nested sequence 
of nonempty compact sets, then the intersection f~j“j ' s nonempty . 

Proof By Lemma 1 the sets G, = K \ \ K , , i = 1 are open in K \ . If the 
intersection fjo^i Ki is empty, then the sequence Gi C Gi C • • • C G n C • • • forms 
a covering of K \ . Extracting a finite covering from it, we find that some element 
G m of the sequence forms a covering of A , But by hypothesis K m — X ■ \ Cl, > , ^0. 
This contradiction completes the proof of Lemma 2. □ 

Lemma 3 (Closed subsets of compact sets) A closed subset F of a compact set K 
is itself compact. 

Proof Let {G a , a e A} be an open covering of F . Adjoining to this collection the 
open set G = K\F, we obtain an open covering of the entire compact set K. From 
this covering we can extract a finite covering of K . Since G fl F — 0, it follows that 
the set {Ga , a e A} contains a finite covering of F . □ 


9.3.2 Metric Compact Sets 

We shall establish below some properties of metric compact sets, that is, metric 
spaces that are compact sets with respect to the topology induced by the metric. 


9.3 Compact Sets 
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Definition 2 The set E C X is called an e-grid in the metric space (X, d ) if for 
every point x e X there is a point e e E such that die, x) < e. 

Lemma 4 (Finite e-grids) If a metric space ( K, d) is compact, then for every e > 0 
there exists a finite e-grids in X. 

Proof For each point x e K we choose an open ball B(x, e). From the open cover- 
ing of K by these balls we select a finite covering B{x \ ,£),..., B(x n , e). The points 
x \ , . . . , x n obviously form the required e-grid. □ 

In analysis, besides arguments that involve the extraction of a finite covering, one 
often encounters arguments in which a convergent subsequence is extracted from an 
arbitrary sequence. As it happens, the following proposition holds. 

Proposition 2 (Criterion for compactness in a metric space) A metric space ( K , d) 
is compact if and only if from each sequence of its points one can extract a subse- 
quence that converges to a point of K . 

The convergence of the sequence {x„} to some point a e K, as before, means 
that for every neighborhood U (a) of the point a e K there exists an index N e N 
such that x„ e U (a) for n > N. 

We shall discuss the concept of limit in more detail below in Sect. 9.6. 

We preface the proof of Proposition 2 with two lemmas. 

Lemma 5 If a metric space (K, d) is such that from each sequence of its points one 
can select a subsequence that converges in K, then for every e > 0 there exists a 
finite e-grid. 

Proof If there were no finite eo-grid for some eo > 0, one could construct a sequence 
{x n } of points in K such that d (x n , Xj ) > £q f° r all « e N and all i e { 1 1 } . 
Obviously it is impossible to extract a convergent subsequence of this sequence. □ 

Lemma 6 If the metric space ( K , d) is such that from each sequence of its points 
one can select a subsequence that converges in K, then every nested sequence of 
nonempty closed subsets of the space has a nonempty intersection. 

Proof If F\ D • • O F n D • • • is the sequence of closed sets, then choosing one point 
of each, we obtain a sequence x \, . . . , x n , . . . , from which we extract a convergent 
subsequence {x ni }. The limit a e K of this sequence, by construction, necessarily 
belongs to each of the closed sets h ) , i e N . □ 

We can now prove Proposition 2. 

Proof We first verify that if ( K,d ) is compact and \x n } a sequence of points in 
it, one can extract a subsequence that converges to some point of K. If the se- 
quence {x n } has only a finite number of different values, the assertion is obvious. 
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Therefore we may assume that the sequence [x n } has infinitely many different val- 
ues. For £j = 1/1, we construct a finite 1-grid and take a closed ball B(a\, 1) 
that contains an infinite number of terms of the sequence. By Lemma 3 the ball 
B(a\, 1) is itself a compact set, in which there exists a finite £2 = 1/2-grid and a 
ball B(a 2 , 1/2) containing infinitely many elements of the sequence. In this way a 
nested sequence of compact sets B(a\ , 1) D #( 02 , 1/2) D • ■ • D B(a n , l/n) D ■ • ■ 
arises, and by Lemma 2 has a common point a e K . Choosing a point x, n of the 
sequence {x n } in the ball B(a\. 1), then a point x„ 2 in B(a 2 , 1/2) with n .2 > n\, and 
so on, we obtain a subsequence {x, H } that converges to a by construction. 

We now prove the converse, that is, we verify that if from every sequence { x n } of 
points of the metric space (K, d ) one can select a subsequence that converges in K, 
then ( K,d ) is compact. 

In fact, if there is some open covering \G a , a e A} of the space ( K,d ) from 
which one cannot select a finite covering, then using Lemma 5 to construct a finite 
1-grid in K, we find a closed ball B{a\, 1), that also cannot be covered by a finite 
collection of sets of the system {B a ,a e A}. 

The ball B(a\ , 1) can now be regarded as the initial set, and, constructing a finite 
1 /2-grid in it, we find in it a ball B{ci 2 , 1 /2) that does not admit covering by a finite 
number of sets in the system {G a , a e A}. 

The resulting nested sequence of closed sets B(a 1 , 1) D Bicij. 1/2) D ••• D 
B(a n , l/n) D • ■ • has a common point a e K by Lemma 6, and the construction 
shows that there is only one such point. This point is covered by some set G ao 
of the system; and since G ao is open, all the sets B(a n , l/n) must be contained 
in G a 0 for sufficiently large values of n . This contradiction completes the proof of 
the proposition. □ 


9.3.3 Problems and Exercises 

1. A subset of a metric space is totally bounded if for every e > 0 it has a finite 
£-grid. 

a) Verify that total boundedness of a set is unaffected, whether one forms the 
grid from points of the set itself or from points of the ambient space. 

b) Show that a subset of a complete metric space is compact if and only if it is 
totally bounded and closed. 

c) Show by example that a closed bounded subset of a metric space is not always 
totally bounded, and hence not always compact. 

2. A subset of a topological space is relatively (or conditionally) compact if its 
closure is compact. 

Give examples of relatively compact subsets of R" . 

3. A topological space is locally compact if each point of the space has a relatively 
compact neighborhood. 

Give examples of locally compact topological spaces that are not compact. 


9.4 Connected Topological Spaces 
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4 . Show that for every locally compact, but not compact topological space ( X , xy) 
there is a compact topological space (Y, xy) such that X C Y, Y\X consists of a 
single point, and the space ( X , rx) is a subspace of the space (Y, Ty). 


9.4 Connected Topological Spaces 

Definition 1 A topological space (X, r) is connected if it contains no open-closed 
sets 4 except X itself and the empty set. 

This definition will become more transparent to intuition if we recast it in the 
following form. 

A topological space is connected if and only if it cannot be represented as the 
union of two disjoint nonempty closed sets (or two disjoint nonempty open sets). 

Definition 2 A set £ in a topological space (X, r) is connected if it is connected as 
a topological subspace of (X, r) (with the induced topology). 

It follows from this definition and Definition 1 that the property of a set of be- 
ing connected is independent of the ambient space. More precisely, if ( X , tx) and 
( Y. Ty) are topological spaces containing £ and inducing the same topology on E, 
then E is connected or not connected simultaneously in both X and Y. 

Example 1 Let £ = {.r e R 1 1 / Oj. The set £_ = {x e E \ x < 0} is nonempty, 
not equal to £, and at the same time open-closed in E (as is E + — {x e M | x > 0}), 
if E is regarded as a topological space with the topology induced by the standard 
topology of R. Thus, as our intuition suggests, E is not connected. 

Proposition (Connected subsets of K) A nonempty set £ct is connected if and 
only if for any x and z belonging to E, the inequalities x < y < z imply that y e E. 

Thus, the only connected subsets of the line are intervals (finite or infinite): open, 
half-open, and closed. 

Proof Necessity. Let £ be a connected subset of K, and let the triple of points a,b,c 
be such that a e £, b e £, but c f. £, even though a < c < b. Setting A = {x e £ | 
x < c}, B — {x e E | x > c}, we see that a e A, b e B. that is, A ^ 0, B 0, 
and A (T B = 0. Moreover £ = A U B. and both sets A and B are open in £. This 
contradicts the connectedness of £. 

Sufficiency. Let £ be a subspace of R having the property that together with any pair 
of points a and b belonging to it, every point between them in the closed interval 
[a, /;] also belongs to £. We shall show that £ is connected. 


4 That is, sets that are simultaneously open and closed. 
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Suppose that A is an open-closed subset of E with A 0 and B — E\A f 0. 
Let a e A and b e B. For definiteness we shall assume that a < b. (We certainly 
have a b, since A fl B = 0.) Consider the point ci = sup{A fl [a. b]}. Since A 3 
a < ci < b e B, we have cj e E. Since A is closed in E , we conclude that c i e A. 

Considering now the point C 2 = inf{ B fl [ci , b] } we conclude similarly, since B 
is closed, that C 2 e B. Thus a < c\ < C 2 < b, since ci e A, C 2 e B, and AflB = 0. 
But it now follows from the definition of cj and C 2 and the relation E — A U B that 
no point of the open interval ]ci, C 2 [ can belong to E. This contradicts the original 
property of E. Thus the set E cannot have a subset A with these properties, and that 
proves that E is connected. □ 


9.4.1 Problems and Exercises 

1. a) Verify that if A is an open-closed subset of (X, r), then B = X\A is also such 
a set. 

b) Show that in terms of the ambient space the property of connectedness of a set 
can be expressed as follows: A subset E of a topological space (X, r) is connected 
if and only if there is no pair of open (or closed) subsets G' x . G" x that are disjoint 
and such that E fl G' x ^ 0, E fl G" x ^ 0, and E C G' x U G" x . 

2. Show the following: 

a) The union of connected subspaces having a common point is connected. 

b) The intersection of connected subspaces is not always connected. 

c) The closure of a connected subspace is connected. 

3. One can regard the group GL(n) of nonsingular n x n matrices with real entries as 

2 

an open subset in the product space R" , if each element of the matrix is associated 
with a copy of the set R of real numbers. Is the space GL(n) connected? 

4. A topological space is locally connected if each of its points has a connected 
neighborhood. 

a) Show that a locally connected space may fail to be connected. 

b) The set E in R 2 consists of the graph of the function x i-»- sin - (for x ^ 0) 
plus the closed interval {(x,y) e R 2 \ x — 0 A |y| < 1} on the v-axis. The set E is 
endowed with the topology induced from R 2 . Show that the resulting topological 
space is connected but not locally connected. 

5. In Sect. 7.2.2 we defined a connected subset of R" as a set E C R" any two 
of whose points can be joined by a path whose support lies in E. In contrast to 
the definition of topological connectedness introduced in the present section, the 
concept we considered in Chap. 7 is usually called path connectedness or arcwise 
connectedness. Verify the following: 

a) A path-connected subset of R" is connected. 



9.5 Complete Metric Spaces 
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b) Not every connected subset of R" with n > 1 is path connected. (See Prob- 
lem 4.) 

c) Every connected open subset of R" is path connected. 


9.5 Complete Metric Spaces 

In this section we shall be discussing only metric spaces, more precisely, a class of 
such spaces that plays an important role in various areas of analysis. 


9.5.1 Basic Definitions and Examples 

By analogy with the concepts that we already know from our study of the space R" , 
we introduce the concepts of fundamental (Cauchy) sequences and convergent se- 
quences of points of an arbitrary metric space. 

Definition 1 A sequence {.r ( , : n e N| of points of a metric space ( X , d) is a fun- 
damental or Cauchy sequence if for every e > 0 there exists N e N such that 
d (x m , x n ) < s for any indices m, neN larger than N. 

Definition 2 A sequence [x n \ n e N} of points of a metric space ( X , d) converges 
to the point a e X and a is its limit if lim„_ j , 00 d(a. x n ) = 0. 

A sequence that has a limit will be called convergent , as before. 

We now give the basic definition. 

Definition 3 A metric space (A, d) is complete if every Cauchy sequence of its 
points is convergent. 

Example 1 The set R of real numbers with the standard metric is a complete met- 
ric space, as follows from the Cauchy criterion for convergence of a numerical se- 
quence. 

We remark that, since every convergent sequence of points in a metric space is 
obviously a Cauchy sequence, the definition of a complete metric space essentially 
amounts to simply postulating the Cauchy convergence criterion for it. 

Example 2 If the number 0, for example, is removed from the set R, the remaining 
set R\0 will not be a complete space in the standard metric. Indeed, the sequence 
x n = l/n, n e N, is a Cauchy sequence of points of this set, but has no limit in R\0. 

Example 3 The space R" with any of its standard metrics is complete, as was ex- 
plained in Sect. 7.2.1. 
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Example 4 Consider the set C[a, b] of real-valued continuous functions on a closed 
interval [a, b] C M, with the metric 

d(f,g) = max \f(x)-g(x)\ (9.9) 

a<x<b 

(see Sect. 9.1, Example 7). We shall show that the metric space C[a, b] is complete. 
Proof Let { /„ (x) : n <= I ) } be a Cauchy sequence of functions in C[a. b], that is 

Ve > 0 3 N <= N Vm eNVneN (fm > N A n > N) => 

=> V.r e [a,b] (| f m (x) - f n (x)\ < e)). (9.10) 

For each fixed value of x 6 [a, i], as one can see from (9.10), the numerical 
sequence {/„(.*); n e N} is a Cauchy sequence and hence has a limit f{x) by the 
Cauchy convergence criterion. 

Thus 


/(*):= lim f n (x), xe[a,b]. (9.11) 

n — >oo 

We shall verify that the function fix) is continuous on [« , /;] , that is, / e C[a, b]. 
It follows from (9.10) and (9. 11) that the inequality 

\f(x) — fn(x)\ < e Vxe[a,b] (9.12) 


holds for n > N. 

We fix the point x e [«, b] and verify that the function / is continuous at this 
point. Suppose the increment h is such that (x + h) e [a, b\. The identity 

fix+h)- f{x) = fix + h) - f n (x + h) + f n (x + h ) - f„ (x) + f, (x) - fix) 
implies the inequality 

| fix + h) - fix) | < | fix + h)- /„ (x +h)\ + 

+ | f n ix + h) - MX) | + | f„ix) - /(x)|. (9.13) 

By virtue of (9.12) the first and last terms on the right-hand side of this last 
inequality do not exceed e if n > N. Fixing n > N, we obtain a function f n e 
C[a, ti], and then choosing S = 8(e) such that |/„(x + h) — f„(x) \ < e for \h\ < 8, 
we find that |/(x + h) — f(x)\ < 3e if \h\ < 8. But this means that the function / 
is continuous at the point x. Since x was an arbitrary point of the closed interval 
[a , b \ , we have shown that f e C[a, b], □ 

Thus the space C[a, b] with the metric (9.9) is a complete metric space. This is 
a very important fact, one that is widely used in analysis. 


9.5 


Complete Metric Spaces 
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Fig. 9.2 



Example 5 If instead of the metric (9.9) we consider the integral metric 


d(f,g)= [ b \f-g\(x)dx (9.14) 


on the same set C[a, b], the resulting metric space is no longer complete. 


Proof For the sake of notational simplicity, we shall assume [a,b] = [—1, 1] and 
consider, for example, the sequence {f n e C[— 1, 1]; n e N} of functions defined as 
follows: 


fn (x) 


1 . if — 1 < x < — l/n, 
nx, \f—l/n<x<l/n, 
1, if l/n < x < 1. 


(See Fig. 9.2.) 

It follows immediately from properties of the integral that this sequence is a 
Cauchy sequence in the sense of the metric (9.14) in C[— 1, 1]. At the same time, it 
has no limit in C[— 1, 1]. For if a continuous function / eC[— 1,1] were the limit 
of this sequence in the sense of metric (9.14), then / would have to be constant on 
the interval — 1 < x < 0 and equal to — 1 while at the same time it would have to be 
constant and equal to 1 on the interval 0 < x < 1, which is incompatible with the 
continuity of / at the point .r = 0. □ 


Example 6 It is slightly more difficult to show that even the set lZ[a, b] of real- 
valued Riemann-integrable functions defined on the closed interval [a,b] is not 
complete in the sense of the metric (9. 14). 5 We shall show this, using the Lebesgue 
criterion for Riemann integrability of a function. 


Proof We take [a, b] to be the closed interval [0, 1], and we shall construct a Cantor 
set on it that is not a set of measure zero. Let A e ]0, l/3[. We remove from the 
interval [0, 1] the middle piece of it of length A. More precisely, we remove the 


5 In regard to the metric (9.14) on TZ[a, b] see the remark to Example 9 in Sect. 9.1. 
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A /2-neighborhood of the midpoint of the closed interval [0, 1]. On each of the two 
remaining intervals, we remove the middle piece of length A ■ 1/3. On each of the 
four remaining closed intervals we remove the middle piece of length A ■ 1 /3 2 , and 
so forth. The length of the intervals removed in this process is A + A • 2/3 + A ■ 

4/3 2 H h A ■ (2/3)" H = 3 A. Since 0 < A < 1/3 we have 1 — 3 A > 0, and, as 

one can verify, it follows from this that the (Cantor) set K remaining on the closed 
interval [0, 1] does not have measure zero in the sense of Lebesgue. 

Now consider the following sequence: {/„ e 7?.[0, 1]; n e N}. Let /„ be a func- 
tion equal to 1 everywhere on [0, 1] except at the points of the intervals removed at 
the first n steps, where it is set equal to zero. It is easy to verify that this sequence is 
a Cauchy sequence in the sense of the metric (9.14). If some function / e 7C[0, 1] 
were the limit of this sequence, then / would have to be equal to the characteristic 
function of the set K at almost every point of the interval [0, 1]. Then / would have 
discontinuities at all points of the set K. But, since K does not have measure 0, one 
could conclude from the Lebesgue criterion that / </ 7?.[0, 1]. Hence '!Z\a. b] with 
the metric (9.14) is not a complete metric space. □ 


9.5.2 The Completion of a Metric Space 

Example 7 Let us return again to the real line and consider the set Q of rational 
numbers with the metric induced by the standard metric on R. 

It is clear that a sequence of rational numbers converging to in M is a Cauchy 
sequence, but does not have a limit in Q, that is, Q is not a complete space with 
this metric. However, Q happens to be a subspace of the complete metric space R, 
which it is natural to regard as the completion of Q. Note that the set Q C R could 
also be regarded as a subset of the complete metric space R 2 , but it does not seem 
reasonable to call R 2 the completion of Q. 

Definition 4 The smallest complete metric space containing a given metric space 
(X, d) is the completion of (X, d). 

This intuitively acceptable definition requires at least two clarifications: what is 
meant by the “smallest” space, and does it exist? 

We shall soon be able to answer both of these questions; in the meantime we 
adopt the following more formal definition. 

Definition 5 If a metric space ( X , d) is a subspace of a complete metric space ( Y . d) 
and the set X C Y is everywhere dense in Y , the space (Y, d) is called a completion 
of the metric space (X,d). 

Definition 6 We say that the metric space (X\,d\) is isometric to the metric space 
( X 2 , rfe) if there exists a bijective mapping f : Xi X 2 such that d 2 (f(a ), .fib)) — 
d\{a.b ) for any points a and b in X \ . (The mapping f : Xi —> X 2 is called an 
isometry in that case.) 
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It is clear that this relation is reflexive, symmetric, and transitive, that is, it is 
an equivalence relation between metric spaces. In studying the properties of metric 
spaces we study not the individual space, but the properties of all spaces isometric 
to it. For that reason one may regard isometric spaces as identical. 

Example 8 Two congruent figures in the plane are isometric as metric spaces, so 
that in studying the metric properties of figures we abstract completely, for example, 
from the location of a figure in the plane, identifying all congruent figures. 

By adopting the convention of identifying isometric spaces, one can show that if 
the completion of a metric space exists at all, it is unique. 

As a preliminary, we verify the following statement. 

Lemma The following inequality holds for any quadruple of points a, b, u, v of the 
metric space (X,d): 


\d(a,b) — d(u,v)\ <d(a,u) + d(b,v). (9.15) 

Proof By the triangle inequality 

d(a,b) <d(a, u ) + d(u, v) + d(b, v). 

By the symmetry of the points, this relation implies (9.15). □ 

We now prove uniqueness of the completion. 

Proposition 1 If the metric spaces (Y\ , d\) and ( Y 2 . d 2 ) are completions of the 
same space (A, d), then they are isometric. 

Proof We construct an isometry / : Y\ — > Y 2 as follows. For x e X we set 
fix) = x. Then d 2 ifix\), f(x 2 )) = d(f{x\), f(x 2 )) = d(x \,x 2 ) = d\(x u x 2 ) for 
x \ , x 2 G X. If yi e Y i \ X , then y\ is a limit point for X, since X is everywhere dense 
in Y i . Let { x „ ; n e N} be a sequence of points of X converging to yi in the sense of 
the metric d\ . This sequence is a Cauchy sequence in the sense of the metric d \ . But 
since the metrics d\ and d 2 are both equal to d on X, this sequence is also a Cauchy 
sequence in ( Y 2 ,d 2 ). The latter space is complete, and hence this sequence has a 
limit y 2 e Y 2 . It can be verified in the standard manner that this limit is unique. We 
now set f(yi) = y 2 . Since any point y 2 e Y 2 \X, just like any point y\ e Fi\A, is the 
limit of a Cauchy sequence of points in X , the mapping / : Y\ — »■ Y 2 so constructed 
is surjective. 

We now verify that 


Mfiy'i)’ f(y ") ) = d] {y'vy'l) ( 9 - 16 ^ 


for any pair of points yj , y" of Y\ . 
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If y' x and y" belong to X, this equality is obvious. In the general case we take 
two sequences {x' n \ n e N} and {x"; n e N} converging to yj and y" respectively. It 
follows from inequality (9.15) that 

di(y[,y'()=\mv o ( h(x'n, x ;), 

or, what is the same, 

di(y[,y '") = lim d(x' n ,x'l). (9.17) 

>oo 

By construction these same sequences converge to y' 2 = f(y[) and y" — f(y 2 ) 
respectively in the space ( Yi , df). Hence 

c h{y^ y'i) = lim d(x' n ,x^). (9.18) 

n — >oo 

Comparing relations (9.17) and (9.18), we obtain Eq. (9.16). This equality then 
simultaneously establishes that the mapping f \ Y\ —> Yi is injective and hence 
completes the proof that / is an isometry. □ 

In Definition 5 of the completion (T, d) of a metric space ( X, d) we required 
that (X,d) be a subspace of (Y,d) that is everywhere dense in ( Y,d ). Under the 
identification of isometric spaces one could now broaden the idea of a completion 
and adopt the following definition. 

Definition 5' A complete metric space (T, dy) is a completion of the metric space 
(X, dx) if there is a dense subspace of (Y, dy) isometric to ( X , dx)- 
We now prove the existence of a completion. 

Proposition 2 Every metric space has a completion. 

Proof If the initial space itself is complete, then it is its own completion. 

We have already essentially demonstrated the idea for constructing the comple- 
tion of an incomplete metric space (X, dx) when we proved Proposition 1. 

Consider the set of Cauchy sequences in the space (X, dx). Two such sequences 
{x' n ; n e N} and {x"; n e N} are called equivalent or confinal if dx(x' n , x'f) — »■ 0 as 
n -> oo. It is easy to see that confinality really is an equivalence relation. We shall 
denote the set of equivalence classes of Cauchy sequences by S. We introduce a 
metric in S by the following rule. If s' and s" are elements of S , and \x ' n ; n e N} and 
{x" ; n e N} are sequences from the classes s' and s" respectively, we set 

d (/, s") = lim d x {x' n , x") . (9.19) 

n — > oo 

It follows from inequality (9.15) that this definition is unambiguous: the limit 
written on the right exists (by the Cauchy criterion for a numerical sequence) and is 
independent of the choice of the individual sequences {x' ; n e N} and {x" ;neN| 
from the classes s' and s" . 
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The function d(s',s ") satisfies all the axioms of a metric. The resulting metric 
space (S, d) is the required completion of the space (X . dx). Indeed, (X,dx) is 
isometric to the subspace ( Sx,d ) of the space (.S', d) consisting of the equivalence 
classes of fundamental sequences that contain constant sequences {x n = x e X\ 
n e N}. It is natural to identify such a class s e S with the point x e X. The mapping 
/ : (X, dx) — »■ (Sx, d) is obviously an isometry. 

It remains to be verified that (Sx, d) is everywhere dense in (S, d) and that (S. d) 
is a complete metric space. 

We first verify that (Sx, d) is dense in (S, d). Let .v be an arbitrary element of S 
and {x n ; n e N} a Cauchy sequence in (X. dx) belonging to the class s e S. Taking 
= f(x n ), n e N, we obtain a sequence {§„; n e N} of points of (Sx, d) that has 
precisely the element seSas its limit, as one can see from (9.19). 

We now prove that the space (S, d) is complete. Let {s n \ n e N) be an arbitrary 
Cauchy sequence in the space (S, d). For each n e N we choose an element in 
(Sx, d) such that d(s n , £«) < 1 /«. Then the sequence {§„; n e N}, like the sequence 
j.v„: n e N|, is a Cauchy sequence. But in that case the sequence \x„ = / _1 (^„); 
n e N} will also be a Cauchy sequence. The sequence { x n ; n e N} defines an ele- 
ment s e S, to which the given sequence {s„; n e N} converges by virtue of relation 
(9.19). □ 

Remark 1 Now that Propositions 1 and 2 have been proved, it becomes understand- 
able that the completion of a metric space in the sense of Definition 5' is indeed the 
smallest complete space containing (up to isometry) the given metric space. In this 
way we have justified the original Definition 4 and made it precise. 

Remark 2 The construction of the set M of real numbers, starting from the set Q 
of rational numbers could have been carried out exactly as in the construction of 
the completion of a metric space, which was done in full generality above. That is 
exactly how the transition from Q to R was carried out by Cantor. 

Remark 3 In Example 6 we showed that the space lZ[a, b] of Riemann-integrable 
functions is not complete in the natural integral metric. Its completion is the impor- 
tant space C[a, b] of Lebesgue-integrable functions. 


9.5.3 Problems and Exercises 

1. a) Prove the following nested ball lemma. Let ( X , d) be a metric space and 
B(x\,r\) D • • • D B(x n , r n ) D • • • a nested sequence of closed balls in X whose 
radii tend to zero. The space (X, d) is complete if and only if for every such sequence 
there exists a unique point belonging to all the balls of the sequence. 

b) Show that if the condition r n -* 0 as n —*■ oo is omitted from the lemma 
stated above, the intersection of a nested sequence of balls may be empty, even in a 
complete space. 
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2. a) A set E C X of a metric space (X, d) is nowhere dense in X if it is not dense 
in any ball, that is, if for every ball B (x , r) there is a second ball B(x\ ,rf) C B ix , r) 
containing no points of the set E. 

A set E is of first category in X if it can be represented as a countable union of 
nowhere dense sets. 

A set that is not of first category is of second category in X. 

Show that a complete metric space is a set of second category (in itself). 

b) Show that if a function / e , b] is such that Vx e [a,b] 3 n e N Vwi > n 

if (m Hx) = 0), then the function / is a polynomial. 


9.6 Continuous Mappings of Topological Spaces 

From the point of view of analysis, the present section and the one following contain 
the most important results in the present chapter. 

The basic concepts and propositions discussed here form a natural, some-times 
verbatim extension to the case of mappings of arbitrary topological or metric spaces, 
of concepts and propositions that are already well known to us in. In the process, 
not only the statement but also the proofs of many facts turn out to be identical 
with those already considered; in such cases the proofs are naturally omitted with a 
reference to the corresponding propositions that were discussed in detail earlier. 


9.6.1 The Limit of a Mapping 

a. The Basic Definition and Special Cases of It 

Definition 1 Let / : X -» Y be a mapping of the set X with a fixed base B = {B} 
in X into a topological space Y . The point A e Y is the limit of the mapping f : 
X -> Y over the base B, and we write lirng fix) = A, if for every neighborhood 
V (A) of A in Y there exists an element B e B of the base B whose image under the 
mapping / is contained in V (A). 

In logical symbols Definition 1 has the form 

lim fix) = A := W (A) c Y 3fl e B (f(B) c V(A)). 

We shall most often encounter the case in which X, like Y, is a topological space 
and B is the base of neighborhoods or deleted neighborhoods of some point a e X. 
Retaining our earlier notation x a for the base of deleted neighborhoods { U (a)} 
of the point a, we can specialize Definition 1 for this base: 

lim f(x) = A:=VF(A) C Y 3U{a)cX ( f(U(a )) C V(A)). 
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If (X, dx) and (Y, dy) are metric spaces, this last definition can be restated in 
s-S language: 

lim f(x) — A Vs > 0 3<$ > 0 V.r e X 

x—>a 

(0 < dx(a , x) <8 => dy (A , f(x)) < e). 

In other words, 


lim f(x ) = A lim dy(A, f(x)) — 0. 

x—>a jc— v 7 

Thus we see that, having the concept of a neighborhood, one can define the con- 
cept of the limit of a mapping f : X Y into a topological or metric space Y just 
as was done in the case Y — HR or, more generally, Y — R" . 


b. Properties of the Limit of a Mapping 

We now make some remarks on the general properties of the limit. 

We first note that the uniqueness of the limit obtained earlier no longer holds 
when Y is not a Hausdorff space. But if Y is a Hausdorff space, then the limit is 
unique and the proof does not differ at all from the one given in the special cases 
F = R or F = R" . 

Next, if / : X -> Y is a mapping into a metric space, it makes sense to speak of 
the boundedness of the mapping (meaning the boundedness of the set f(X) in F), 
and of ultimate boundedness of a mapping with respect to the base B in X (meaning 
that there exists an element B of B on which / is bounded). 

It follows from the definition of a limit that if a mapping f : X Y of a set X 
with base B into a metric space Y has a limit over the base B, then it is ultimately 
bounded over that base. 


c. Questions Involving the Existence of the Limit of a Mapping 

Proposition 1 (Limit of a composition of mappings) Let Y be a set with base By 
and g : Y — > Z a mapping of Y into a topological space Z having a limit over the 
base By . 

Let X be a set with base Bx and f : X — > F a mapping of X into Y such that for 
every element By e By there exists an element Bx € Bx whose image is contained 
in By, that is, f (Bx) C By . 

Under these hypotheses the composition g o f : X — > Z of the mappings f and 
g is defined and has a limit over the base Bx, and 


Yrngo f(x) = \\mg{y). 
Bx By 
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For the proof see Theorem 5 of Sect. 3.2. 

Another important proposition on the existence of the limit is the Cauchy crite- 
rion, to which we now turn. This time we will be discussing a mapping f : X Y 
into a metric space, and in fact a complete metric space. 

In the case of a mapping / : X — > Y of the set X into a metric space (V, cl) it is 
natural to adopt the following definition. 

Definition 2 The oscillation of the mapping / : X -> Y on a set E C X is the 
quantity 

co(f,E)= sup d(f(x l ),f(x 2 )). 

Xi, X2 €E 


The following proposition holds. 

Proposition 2 (Cauchy criterion for existence of the limit of a mapping) Let X be 
a set with a base B, and let f : X — > Y be a mapping of X into a complete metric 
space ( Y,d ). 

A necessary and sufficient condition for the mapping f to have a limit over the 
base B is that for every e > 0 there exists an element B in B on which the oscillation 
of the mapping is less than s. 

More briefly: 


3 lim f(x) <(=>■ Ve >03 B e B (co{f, B) < e). 

For the proof see Theorem 4 of Sect. 3.2. 

It is useful to remark that the completeness of the space Y is needed only in the 
implication from the right-hand side to the left-hand side. Moreover, if Y is not a 
complete space, it is usually this implication that breaks down. 


9.6.2 Continuous Mappings 

a. Basic Definitions 

Definition 3 A mapping / : X — > Y of a topological space ( X , X\) into a topo- 
logical space (F, r y) is continuous at a point a e X if for every neighborhood 
V(f(a)) C Y of the point f(a) e Y there exists a neighborhood U(a) C X of the 
point a e X whose image f{U (a)) is contained in V (f(a)). 

Thus, 

/ : X Y is continuous at a e X 

= W (f(a)) 3 U(a) ( f{U(a )) C V(f(a))). 



9.6 Continuous Mappings of Topological Spaces 


31 


In the case when X and Y are metric spaces ( X , dx) and ( Y, dy), Definition 3 
can of course be stated in s-8 language: 

/ : X —*■ Y is continuous at a e X := 

= Ve > 0 35 > 0 Vx e X ( dx(a , x) < 8 =>■ dy(f(a), f(x)) < e). 

Definition 4 The mapping / : X — > Y is continuous if it is continuous at each point 
rel 

The set of continuous mappings from X into Y will be denoted C(X,Y). 

Theorem 1 (Criterion for continuity) A mapping f : X —*■ Y of a topological space 
(. X , Tx) into a topological space (Y, x y) is continuous if and only if the pre-image 
of every open ( resp . closed ) subset of y is open ( resp . closed) in X. 

Proof Since the pre-image of a complement is the complement of the pre-image, it 
suffices to prove the assertions for open sets. 

We first show that if f e C(X, Y) and Gy e ry, then Gx = f~ l (Gy) be- 
longs to tx- If Gx = 0, it is immediate that the pre-image is open. If Gx f 0 
and a e Gx, then by definition of continuity of the mapping / at the point a, 
for the neighborhood Gy of the point /(a) there exists a neighborhood Ux(a ) 
of a e X such that f(Ux(a)) C Gy. Hence Ux(a ) C Gx = f~ l (Gy). Since 
Gx — [JaeGx Ux(a), we conclude that Gx is open, that is, Gx € tx- 

We now prove that if the pre-image of every open set in Y is open in X, then 
/ e C(X, Y). But, taking any point a e X and any neighborhood Vy(f(a)) of its 
image f(a) in Y, we discover that the set Ux(a ) = f~ > (Vy(f(a))) is an open 
neighborhood of a e X, whose image is contained in Vy(f(a)). Consequently we 
have verified the definition of continuity of the mapping / : X -> Y at an arbitrary 
point a e X. □ 

Definition 5 A bijective mapping / : X — >■ Y of one topological space (X. rx) onto 
another ( Y, ry ) is a homeomorphism if both the mapping itself and the inverse map- 
ping f~ i :Y-^-X are continuous. 

Definition 6 Topological spaces that admit homeomorphisms onto one another are 
said to be homeomorphic. 

As Theorem 1 shows, under a homeomorphism / : X —*■ Y of the topological 
space ( X , xx) onto (T, ry) the systems of open sets xx and ty correspond to each 
other in the sense that Gx e xx O f(Gx ) — Gy e ry. 

Thus, from the point of view of their topological properties homeomorphic 
spaces are absolutely identical. Consequently, homeomorphism is the same kind 
of equivalence relation in the set of all topological spaces as, for example, isometry 
is in the set of metric spaces. 
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b. Local Properties of Continuous Mappings 

We now exhibit the local properties of continuous mappings. They follow immedi- 
ately from the corresponding properties of the limit. 

Proposition 3 (Continuity of a composition of continuous mappings) Let ( X , xx), 
(F, ry) and (Z, xz) be topological spaces. If the mapping g : Y — »■ Z is continuous 
at a point b e Y and the mapping f : X — > Y is continuous at a point a e X for 
which f (a) — b. then the composition of these mappings go f : X — > Z is continu- 
ous at a e X. 

This follows from the definition of continuity of a mapping and Proposition 1 . 

Proposition 4 (Boundedness of a mapping in a neighborhood of a point of conti- 
nuity) If a mapping f : X — >• Y of a topological space (X, x) into a metric space 
(F, d) is continuous at a point a e X, then it is bounded in some neighborhood of 
that point. 

This proposition follows from the ultimate boundedness (over a base) of a map- 
ping that has a limit. 

Before stating the next proposition on properties of continuous mappings, we 
recall that for mappings into R or R" we defined the quantity 

«(/; a) := lirn &>(/, B(a , r)) 

to be the oscillation of f at the point a. Since both the concept of the oscillation 
of a mapping on a set and the concept of a ball B(a, r) make sense in any metric 
space, the definition of the oscillation a>(f, a) of the mapping / at the point a also 
makes sense for a mapping / : X Y of a metric space (X . dx) into a metric space 
(Y,dy). 

Proposition 5 A mapping f : X — >■ Y of a metric space ( X , dx) into a metric space 
(F, dy) is continuous at the point a e X if and only if co{f, a) = 0. 

This proposition follows immediately from the definition of continuity of a map- 
ping at a point. 


c. Global Properties of Continuous Mappings 

We now discuss some of the important global properties of continuous mappings. 


Theorem 2 The image of a compact set under a continuous mapping is compact. 
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Proof Let / : K — »• Y be a continuous mapping of the compact space (K, x ^ ) into 
a topological space ( Y, xy), and let {Gy, a e A} be a covering of f(K) by sets 
that are open in Y. By Theorem 1, the sets { Gf — / _1 (G“), w e A) form an open 
covering of K. Extracting a finite covering Gy , . . . , G a f , we find a finite covering 
Gy 1 , . . . , Gy" of f(K) C Y. Thus f(K) is compact in Y . □ 

Corollary A continuous real-valued function f \ K Ron a compact set assumes 
its maximal value at some point of the compact set ( and also its minimal value , at 
some point). 

Proof Indeed, f{K) is a compact set in R, that is, it is closed and bounded. This 
means that inf f(K) e f(K) and sup f(K) e f{K). □ 

In particular, if K is a closed interval [a, b] C R, we again obtain the classical 
theorem of Weierstrass. 

Cantor’s theorem on uniform continuity carries over verbatim to mappings that 
are continuous on compact sets. Before stating it, we must give a necessary defini- 
tion. 

Definition 7 A mapping / : X — »■ Y of a metric space ( X , dx) into a metric space 
(T, dy) is uniformly continuous if for every s > 0 there exists 8 > 0 such that the 
oscillation a>(f, E) of / on each set E C X of diameter less than 8 is less than e. 

Theorem 3 (Uniform continuity) A continuous mapping f : K — »■ Y of a compact 
metric space K into a metric space (T, dy) is uniformly continuous. 

In particular, if A" is a closed interval in R. and Y = R, we again have the classical 
theorem of Cantor, the proof of which given in Sect. 4.2.2 carries over with almost 
no changes to this general case. 

Let us now consider continuous mappings of connected spaces. 

Theorem 4 The image of a connected topological space under a continuous map- 
ping is connected. 

Proof Let / : X -» Y be a continuous mapping of a connected topological space 
( X , xx) onto a topological space (T, ry). Let Ey be an open-closed subset of Y . By 
Theorem 1, the pre-image Ex = f~ l (Ey) of the set Ey is open-closed in X. By the 
connectedness of X, either Ex = 0 or Ex = X. But this means that either Ey = 0 
or Ey = Y = f[X). □ 

Corollary If a function f : X — > R is continuous on a connected topological space 
(X, x) and assumes values f(a) — A e M and f{b) = B e 1, then for any num- 
ber C between A and B there exists a point c e X at which /(c) = C . 
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Proof Indeed, by Theorem 4 f(X) is a connected set in R. But the only connected 
subsets of R are intervals (see the Proposition in Sect. 9.4). Thus the point C belongs 
to f(X ) along with A and B. □ 

In particular, if X is a closed interval, we again have the classical intermediate- 
value theorem for a continuous real-valued function. 


9.6.3 Problems and Exercises 

1. a) If the mapping f : X —> Y is continuous, will the images of open (or closed) 
sets in X be open (or closed) in T? 

b) If the image, as well as the inverse image, of an open set under the mapping 
/ : X — > Y is open, does it necessarily follow that / is a homeomorphism? 

c) If the mapping / : X —*■ Y is continuous and bijective, is it necessarily a 
homeomorphism? 

d) Is a mapping satisfying b) and c) simultaneously a homeomorphism? 

2. Show the following. 

a) Every continuous bijective mapping of a compact space into a Hausdorff 
space is a homeomorphism. 

b) Without the requirement that the range be a Hausdorff space, the preceding 
statement is in general not true. 

3. Determine whether the following subsets of R" are (pairwise) homeomorphic as 
topological spaces: a line, an open interval on the line, a closed interval on the line; 
a sphere; a torus. 

4. A topological space (X, r) is arcwise connected or path connected if any two 
of its points can be joined by a path lying in X. More precisely, this means that for 
any points A and B in X there exists a continuous mapping / : I -> X of a closed 
interval [a, b] C R into X such that f(a) = A and f{b) = B. 

a) Show that every path connected space is connected. 

b) Show that every convex set in R" is path connected. 

c) Verify that every connected open subset of R" is path connected. 

d) Show that a sphere S(a,r ) is path connected in R", but that it may fail to be 
connected in another metric space, endowed with a completely different topology. 

e) Verify that in a topological space it is impossible to join an interior point of a 
set to an exterior point without intersecting the boundary of the set. 


9.7 The Contraction Mapping Principle 

Here we shall establish a principle that, despite its simplicity, turns out to be an 
effective way of proving many existence theorems. 
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Definition 1 A point a e X is a. fixed point of a mapping f : X —> X if f(a) = a. 

Definition 2 A mapping / : X —*■ X of a metric space (A, d) into itself is called a 
contraction if there exists a number q,0 < q < 1, such that the inequality 

d{f(xi),f(x 2 )) <qd(x i,x 2 ) (9.20) 

holds for any points x\ and x 2 in X. 

Theorem (Picard 6 7 -Banach fixed-point principle) A contraction mapping f : X — »■ 
X of a complete metric space (X, d) into itself has a unique fixed point a. 

Moreover, for any point xq G X the recursively defined sequence xo,.*i = 
f (xo), . x n +\ = f(x n ), ... converges to a. The rate of convergence is given by 
the estimate 

q n 

d(a,x n )< d(x i,jco). (9.21) 

i -q 

Proof We shall take an arbitrary point a'q e X and show that the sequence xo,xi — 
f(x o), .... x„+i = f(x n ), ... is a Cauchy sequence. The mapping / is a contrac- 
tion, so that by Eq. (9.20) 


d{x n+ i,x n ) <qd(x n ,x„- 1 ) < ••• < q n d(xi,x 0 ) 


and 


d(x n+ k,x n ) < d(x n ,x n+ 1 ) H \-d(x n+k -i,x n+k ) < 

< (q n +q' ,+1 H h q n+k ~ l )d(xi, xo) < — d(x\,xo). 

1 — q 

From this one can see that the sequence xo, x \, . . . , x n , . . . is indeed a Cauchy 
sequence. 

The space (X,d) is complete, so that this sequence has a limit lim„_ >00 .T„ = 
aeX. 

ft is clear from the definition of a contraction mapping that a contraction is always 
continuous, and therefore 

a = lim x n+ i = lim f(x n ) = /( lim x„ ) — f(a). 

n — > oo n — >oo \n— >oo / 

Thus a is a fixed point of the mapping /. 


6 Ch.E. Picard (1856-1941) - French mathematician who obtained many important results in the 
theory of differential equations and analytic function theory. 

7 S. Banach (1892-1945) - Polish mathematician, one of the founders of functional analysis. 
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The mapping / cannot have a second fixed point, since the relations a,- = /(a,), 
i = l,2, imply, when we take account of (9.20), that 

0 <d(au a 2 ) = d(f(ai), f(a 2 )) <qd(a\,a 2 ), 

which is possible only if d(a\, a 2 ) = 0, that is, a\ = a 2 . 

Next, by passing to the limit as k —*■ oo in the relation 

q " 

d(x n+ k,x n ) < - d(x i,x 0 ), 

1 -q 


we find that 

q n 

d(a,x n )< d(x i,xo). □ 

1 -q 

The following proposition supplements this theorem. 

Proposition (Stability of the fixed point) Let ( X , d) be a complete metric space 
and (L2 , r) a topological space that will play the role of a parameter space in what 
follows. 

Suppose to each value of the parameter t e L2 there corresponds a contraction 
mapping f t : X — > X of the space X into itself and that the following conditions 
hold. 

a) The family [f t ',t e T2} is uniformly contracting, that is, there exists cj, 
0 < q < 1, such that each mapping f t is a q-cont faction . 

b) For each x e X the mapping f t fx ) : F2 —*■ X is continuous as a function of t 
at some point to e L2 , that is lim^^ ft(x) — fi 0 (x). 

Then the solution a(t) e X of the equation x — ft(x) depends continuously on t 
at the point to, that is, lim f _» fo a(t) — a (to). 

Proof As was shown in the proof of the theorem, the solution a ( t ) of the equation 
x = f t (x) can be obtained as the limit of the sequence [x n +i = ft(x „ ); n — 0, 1, . . .} 
starting from any point A'o € X. Let xo = a(to) = ft 0 (a(to)). 

Taking account of the estimate (9.21) and condition a), we obtain 

d(a(t), a(to)) — d{a(t), xo) < 

< ]-~^(xi,xo)= Y~d(ft(a(to)), ft 0 (a(to))). 

By condition b), the last term in this relation tends to zero as t —*■ to- Thus it has 
been proved that 

lim d(aft), fl(fo)) = 0, that is lim a(t) = a (to). 
t^i o v ' t^t 0 


□ 
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Example 1 As an important example of the application of the contraction map- 
ping principle we shall prove, following Picard, an existence theorem for the so- 
lution of the differential equation y' (x) — f (x , y (x)) satisfying an initial condition 
y(x o) = yo- 

If the function f e C(K 2 , R) is such that 

| f(u, Ul) - flu, V 2 )\ < M\vi - V 2 \, 

where M is a constant, then, for any initial condition 

y(x 0 ) = vo, (9.22) 

there exists a neighborhood U (xo) of xo £ R and a unique function y — y(x) de- 
fined in U (.ro) satisfying the equation 

y' = f(x,y) (9.23) 


and the initial condition (9.22). 


Proof Equation (9.23) and the condition (9.22) can be jointly written as a single 
relation 


y(.x) = yo + 


f f(t,y (0) 

Jx o 


dr. 


(9.24) 


Denoting the right-hand side of this equality by A(y), we find that A : 
C(V(x 0 ),R) — > C(V(xo),R) is a mapping of the set of continuous functions de- 
fined on a neighborhood V (xo) of xq into itself. Regarding C(V (xq), M) as a metric 
space with the uniform metric (see formula (9.6) from Sect. 9.1), we find that 


d{Ay\, Ays) = max 

xeV(x 0 )\ 


f f(t,y\(t))dt- f f(t,y 2 (t))dt 

J XO JXQ 


< 


< max / M\yi(t) - y 2 (t)\dt 

X&V {xqMJ x 0 


< M\x — xa\d(y\, y 2 ). 


If we assume that \x — xq\ < then the inequality 

1 

d{Ay\, Ay 2 ) < -d(yi,y 2 ) 

is fulfilled on the corresponding closed interval 7, where d (yi , y 2 ) = max vs / |yi (x) — 
y 2 (x ) | . Thus we have a contraction mapping 


A:C(/,K)-> C(/,R) 


of the complete metric space (C(7, R) , d) (see Example 4 of Sect. 9.5) into itself, 
which by the contraction mapping principle must have a unique fixed point y — Ay. 
But this means that the function in C(7, M) just found is the unique function defined 
on 7 3 xo and satisfying Eq. (9.24). □ 
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Example 2 As an illustration of what was just said, we shall seek a solution of the 
familiar equation 

y = y 

with the initial condition (9.22) on the basis of the contraction mapping principle. 
In this case 

Ay = yo + [ y(t)dt, 

Jx 0 

and the principle is applicable at least for \x — xo\ <q < 1 . 

Starting from the initial approximation y{x) = 0, we construct successively the 
sequence 0, y\ = A(0), . . . , y n +\ it) = A(y n (f)), ... of approximations 

Vi (0 = To, 

yi(t) = yo(l + (x-.ro)), 

V3(0 = + (x - .r 0 ) + ] -(x - x 0 ) 2 ^ , 


/ 1 , 1 

y,i+i (t ) = yo l + (i-io)+-(.r-xo) H h — (x - xo) n 

\ 2! n\ 


from which it is already clear that 

>’(.r) = yoe* - * 0 . 

The fixed-point principle stated in the theorem above also goes by the name of 
the contraction mapping principle. It arose as a generalization of Picard’s proof 
of the existence theorem for a solution of the differential equation (9.23), which 
was discussed in Example 1 . The contraction mapping principle was stated in full 
generality by Banach. 

Example 3 (Newton’s method of finding a root of the equation f(x) — 0) Suppose a 
real-valued function that is convex and has a positive derivative on a closed interval 
[a, fi] assumes values of opposite signs at the endpoints of the interval. Then there 
is a unique point a in the interval at which /(a) = 0. In addition to the elementary 
method of finding the point a by successive bisection of the interval, there also 
exist more sophisticated and rapid methods of finding it, using the properties of the 
function /. Thus, in the present case, one may use the following method, proposed 
by Newton and called Newton ’s method or the method of tangents. Take an arbitrary 
point xq e [a, f] and write the equation y = f(x o) + fix o)(x — Jto) of the tangent 
to the graph of the function at the point (a'q , f(x o)). We then find the point x\ — 
xo — [f'(x o)] -1 • f{x o) where the tangent intersects the x-axis (Fig. 9.3). We take 
xi as the first approximation of the root a and repeat this operation, replacing xq 
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by x\ . In this way we obtain a sequence 

x„+ 1 =x n - [ f'{x n )] * • /On) (9.25) 

of points that, as one can verify, will tend monotonically to a in the present case. 

In particular, if / (x) = x k — a, that is, when we are seeking ffa, where a > 0, 
the recurrence relation (9.25) has the form 

x n~ a 

%n + 1 — %n h_ 1 ? 

kx„ 

which for k — 2 becomes the familiar expression 

a 

x n -\ 

X n 

The method (9.25) for forming the sequence {x n } is called Newton ’s method. 

If instead of the sequence (9.25) we consider the sequence obtained by the recur- 
rence relation 

Xn+l =x n - [/'(xo)] 1 ' /On), (9.26) 

we speak of the modified Newton’s method , 8 The modification amounts to comput- 
ing the derivative once and for all at the point xq. 

Consider the mapping 

X AO) = x- [/' Oo)] 1 ■ f{x). (9.27) 

By Lagrange’s theorem 

|A(X 2 ) - A(xi)| = I [/'Oo)] 1 ■ /'(£) | ■ \X 2 -X\\, 
where | is a point lying between xi and x%. 



8 In functional analysis it has numerous applications and is called the Newton-Kantorovich method. 
L.V. Kantorovich (1912-1986) - eminent Soviet mathematician, whose research in mathematical 
economics earned him the Nobel Prize. 
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Thus, if the conditions 

Ail) C / (9.28) 

and 

|[/'M _1 •/'(*)!< <7 <1, (9.29) 

hold on some closed interval I C R, then the mapping ,4 : / — > / defined by relation 
(9.27) is a contraction of this closed interval. Then by the general principle it has 
a unique fixed point on the interval. But, as can be seem from (9.27), the condition 
A(a ) = a is equivalent to f(a) — 0. 

Hence, when conditions (9.28) and (9.29) hold for a function /, the modi- 
fied Newton’s method (9.26) leads to the required solution x = a of the equation 
f{x) = 0 by the contraction mapping principle. 


9.7.1 Problems and Exercises 

1. Show that condition (9.20) in the contraction mapping principle cannot be re- 
placed by the weaker condition 

d{f{x i), fix 2 j) < d(x \ , xf). 

2. a) Prove that if a mapping / : X X of a complete metric space iX, d) into 
itself is such that some iteration of it f" : X — »■ X is a contraction, then / has a 
unique fixed point. 

b) Verify that the mapping A : C if R) — >■ C (/, M) in Example 2 is such that for 
any closed interval I ct some iteration A n of the mapping A is a contraction. 

c) Deduce from b) that the local solution y = yoe x ~ x ° found in Example 2 is 
actually a solution of the original equation on the entire real line. 

3. a) Show that in the case of a function on [a, ft] that is convex and has a positive 
derivative and assumes values of opposite signs at the endpoints, Newton’s method 
really does give a sequence {x n } that converges to the point a e [a, /!] at which 
fia) = 0. 

b) Estimate the rate of convergence of the sequence (9.25) to the point a. 


Chapter 10 

^Differential Calculus from a More General 
Point of View 


10.1 Normed Vector Spaces 

Differentiation is the process of finding the best local linear approximation of a func- 
tion. For that reason any reasonably general theory of differentiation must be based 
on elementary ideas connected with linear functions. From the course in algebra the 
reader is well acquainted with the concept of a vector space, as well as linear de- 
pendence and independence of systems of vectors, bases and dimension of a vector 
space, vector subspaces, and so forth. In the present section we shall present vec- 
tor spaces with a norm, or as they are described, normed vector spaces, which are 
widely used in analysis. We begin, however, with some examples of vector spaces. 


10.1.1 Some Examples of Vector Spaces in Analysis 

Example 1 The real vector space R" and the complex vector space C" are classi- 
cal examples of vector spaces of dimension n over the fields of real and complex 
numbers respectively. 

Example 2 In analysis, besides the spaces R" and C" exhibited in Example 1, 
we encounter the space closest to them, which is the space l of sequences x — 
(x 1 , . . . , x", . . .) of real or complex numbers. The vector-space operations in t, as 
in R" and C" , are carried out coordinatewise. One peculiarity of this space, when 
compared with R" or C" is that any finite subsystem of the countable system of 
vectors {x,- = (0, . . . , 0, x‘ = 1,0,...),/ e N} is linearly independent, that is, l is an 
infinite-dimensional vector space (of countable dimension in the present case). 

The set of finite sequences (all of whose terms are zero from some point on) is a 

vector subspace £ of the space t, also infinite-dimensional, 
o 

Example 3 Let F[a, b] be the set of numerical-valued (real- or complex-valued) 
functions defined on the closed interval [a, b\. This set is a vector space over the 
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corresponding number field with respect to the operations of addition of functions 
and multiplication of a function by a number. 

The set of functions of the form 

1 0, if x e [a, b] and x r, 

1, if x e [a, b] and x — r 

is a continuously indexed system of linearly independent vectors in F[a, /;]. 

The set C[a,b] of continuous functions is obviously a subspace of the space 
F[a, b] just constructed. 

Example 4 If X\ and AS are two vector spaces over the same field, there is a nat- 
ural way of introducing a vector-space structure into their direct product X \ x AS, 
namely by carrying out the vector-space operations on elements x = (a'i , xi) e 
X[ x X 2 coordinatewise. 

Similarly one can introduce a vector-space structure into the direct product X[ x 
• • • x X n of any finite set of vector spaces. This is completely analogous to the cases 
of R" and C" . 


10.1.2 Norms in Vector Spaces 

We begin with the basic definition. 

Definition 1 Let X be a vector space over the field of real or complex numbers. 

A function || || : X -> R. assigning to each vector x e X a real number ||*|| is 
called a norm in the vector space X if it satisfies the following three conditions: 

a) ||x|| = 0 x = 0 (nondegeneracy); 

b) ||Ax|| = |A|||x|| (homogeneity); 

c) I|xi +X 2 II < 11*1 II + j|* 2 II (the triangle inequality). 

Definition 2 A vector space with a norm defined on it is called a normed vector 
space. 

Definition 3 The value of the norm at a vector is called the norm of that vector. 

The norm of a vector is always nonnegative and, as can be seen by a), equals zero 
only for the zero vector. 

Proof Indeed, by c), taking account of a) and b), we obtain for every x e X. 


□ 
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By induction, condition c) implies the following general inequality. 

II x\ 4 +X 71 II < ll-Xlll + • • • + ll^nlli (10.1) 

and taking account of b), one can easily deduce from c) the following useful in- 
equality. 

Infill -||*2ll|<ll* 1 -* 2 ||. (10.2) 

Every normed vector space has a natural metric 

d(x i,x 2 ) = H-Xt - x 2 ||. (10.3) 

The fact that the function d(x\,X2 ) just defined satisfies the axioms for a metric 
follows immediately from the properties of the norm. Because of the vector-space 
structure in X the metric cl in X has two additional special properties: 

d{x 1 + x, X 2 + x) = I (xi + x) — (X 2 + x ) I = ||xi — X 2 W = d(x 1 , X 2 ), 

that is, the metric is translation-invariant, and 

d(kx\, XX 2 ) = ||kxi — k.T 2 || = ||k(.ri — X 2 ) I = |k|||xi — X 2 II = \X\d{x\, X 2 ), 


that is, it is homogeneous. 

Definition 4 If a normed vector space is complete as a metric space with the natural 
metric (10.3), it is called a complete normed vector space or Banach space. 


Example 5 If for p > 1 we set 



(10.4) 


for x = (x l , . . . , x n ) e R", it follows from Minkowski’s inequality that we obtain a 
norm on R" . The space M" endowed with this norm will be denoted W p . 

One can verify that 


x\\p 2 < IMIpi* if 1 < pi < P2, 


and that 


maxt \x 


as p — > + 00 . Thus, it is natural to set 


.11 


||x||oo := max{ |.r |, . . . 
It then follows from (10.4) and (10.5) that 


k"l) 

* 1 ). 

for p > 1 . 


(10.5) 

( 10 . 6 ) 

(10.7) 


OO 


( 10 . 8 ) 



44 


10 *Differential Calculus from a More General Point of View 


It is clear from this inequality, as in fact it is from the very definition of the norm 
jc ||p in Eq. (10.4), that M." is a complete normed vector space. 


Example 6 The preceding example can be usefully generalized as follows. If X — 
X[ x • • • x X n is the direct product of normed vector spaces, one can introduce the 
norm of a vector x = (xi , . . . , x n ) in the direct product by setting 


up - 



P> 1, 


(10.9) 


where ||x; || is the norm of the vector x; e A,-. 

Naturally, inequalities (10.8) remain valid in this case as well. 

From now on, when the direct product of normed spaces is considered, unless 
the contrary is explicitly stated, it is assumed that the norm is defined in accordance 
with formula (10.9) (including the case p = +oo). 


Example 7 Let p > 1. We denote by t p the set of sequences x = (x 1 , . . . , x", . . .) of 
real or complex numbers such that the series \x n \ p converges, and for x e l p 

we set 



( 10 . 10 ) 


Using Minkowski’s inequality, one can easily see that l p is a normed vector 
space with respect to the standard vector-space operations and the norm (10.10). 
This is an infinite-dimensional space with respect to which W p is a vector subspace 
of finite dimension. 

All the inequalities (10.8) except the last are valid for the norm (10.10). It is not 
difficult to verify that l p is a Banach space. 


Example 8 In the vector space C[a. b] of numerical-valued functions that are con- 
tinuous on the closed interval [a, b\, one usually considers the following norm: 

||/|| := max |/(x)|. (10.11) 

xe[fl,6] 

We leave the verification of the norm axioms to the reader. We remark that this 
norm generates a metric on C[a, b] that is already familiar to us (see Sect. 9.5), and 
we know that the metric space that thereby arises is complete. Thus the vector space 
C[a, b] with the norm (10.1 1) is a Banach space. 


Example 9 One can also introduce another norm in C[a, b] 

\\f\\ P :=(f* \f\ p (x)dxy , P > 1, (10.12) 

which becomes (10.11) as p —*■ +oo. 
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It is easy to see (for example. Sect. 9.5) that the space C[a, b] with the norm 
(10.12) is not complete for 1 < p < +oo. 


10.1.3 Inner Products in Vector Spaces 

An important class of normed spaces is formed by the spaces with an inner product. 
They are a direct generalization of Euclidean spaces. 

We recall their definition. 

Definition 5 We say that a Hermitian form is defined in a vector space X (over the 
field of complex numbers) if there exists a mapping (,):XxX->C having the 
following properties: 

a) (xi,x 2 ) = <x 2 ,xi), 

b) (Xxi,X 2 ) = X(X 1 ,X 2 ), 

C) (X\ + X 2 ,X 3 ) = <Xi,X 3 ) + <X 2 ,X 3 ), 

where x \ , x 2 , x 3 are vectors in X and AeC. 

It follows from a), b), and c), for example, that 


(xi, Xx2) = (A.x 2 , xi) = A.(x 2 , xi) = X(x2, xi) = X{x\, x 2 ); 

(xi,x 2 + X3> = (x 2 + x 3 ,xi) = (x 2 , x i ) + (X3,xi) = (xi,x 2 ) + (xi,X 3 ); 

(x, x) = (x, x), that is, (x, x) is a real number. 

A Hermitian form is called nonnegative if 

d) (x, x) > 0 
and nondegenerate if 

e) (x,x) = 0 x = 0. 

If A is a vector space over the field of real numbers, one must of course consider 
a real-valued form (xi,x 2 ). In this case a) can be replaced by (xi,x 2 ) = (x 2 ,xi), 
which means that the form is symmetric with respect to its vector arguments x\ 
and x 2 . 

An example of such a form is the dot product familiar from analytic geometry 
for vectors in three-dimensional Euclidean space. In connection with this analogy 
we make the following definition. 

Definition 6 A nondegenerate nonnegative Hermitian form in a vector space is 
called an inner product in the space. 
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Example 10 An inner product of vectors x — (x 1 , . . . , x n ) and y = (v 1 , . . . , >’") 
in R" can be defined by setting 


n 

(x,y>:=J>y, (10.13) 

i= 1 

and in C" by setting 

n 

(x,y)-.= ^2,x‘y i . (10.14) 

1 = 1 

Example 11 In £j the inner product of the vectors x and y can be defined as 

OO 

{x,y) ; = £-*'/• 

i = 1 

The series in this expression converges absolutely since 


2 £kV|<£kT + £|/T. 


i = 1 


i = 1 i = l 


Example 12 An inner product can be defined in C[a, b\ by the formula 


-L 


{ f,g)-= (f-gKx) dx. 


(10.15) 


It follows easily from properties of the integral that all the requirements for an 
inner product are satisfied in this case. 


The following important inequality, known as the Cauchy-Bunyakovskii inequal- 
ity, holds for the inner product: 


| (x, y> | 2 < (x,x) ■ (y, y), (10.16) 

where equality holds if and only if the vectors x and y are collinear. 

Proof Indeed, let a = (x, x), b = (x, y), and c = (y, y). By hypothesis a > 0 and 
c > 0. If c > 0, the inequalities 

0 £ ( x “I - Xy , x Xy) = ci bX bX cXX 


with X = — - imply 
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bb bb bb 


0 <a h 


c c c 


or 


0 <ac — bb = ac — \b\ 2 , 


(10.17) 


which is the same as (10.16). 

The case a > 0 can be handled similarly. 

If a = c = 0, then, setting X = —b in (10.17), we find 0 < — bb — bb = — 2\b\ 2 , 
that is, b — 0, and (10.16) is again true. 

If x and y are not collinear, then 0 < (x + Xy, x + Xy) and consequently inequality 
(10.16) is a strict inequality in this case. But if x and y are collinear, it becomes 
equality as one can easily verify. □ 

A vector space with an inner product has a natural norm: 



(10.18) 


and metric 


d(x, y) := ||x - y|| . 


Using the Cauchy-Bunyakovskii inequality, we verify that if (x, y) is a nonde- 
generate nonnegative Hermitian form, then formula (10.18) does indeed define a 
norm. 

Proof In fact, 


x 


y/ {X, X) — 0 <3> X — 0, 


since the form (jc, y) is nondegenerate. 
Next, 



We verify finally that the triangle inequality holds: 


* + y\\ < 11*11 + ll.vll- 


Thus, we need to show that 


y/{x + y,x + y) < y/{x,x) + y/(y,y). 


or, after we square and cancel, that 


(x, y) + {y, x) < 2y/{x,x) ■ {y,y). 


But 


(x, y) + (y, x) = (x, y) + (x, y) = 2Re(x, y) < 2|(x, y}|, 
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and the inequality to be proved now follows immediately from the Cauchy- 
Bunyakovskii inequality (10.16). □ 

In conclusion we note that finite-dimensional vector spaces with an inner product 
are usually called Euclidean or Hermitian ( unitary ) spaces according as the field of 
scalars is K or C respectively. If a normed vector space is infinite-dimensional, it 
is called a Hilbert space if it is complete in the metric induced by the natural norm 
and a pre-Hilbert space otherwise. 


10.1.4 Problems and Exercises 

1. a) Show that if a translation-invariant homogeneous metric d(x\ , xj) is defined 
in a vector space X , then X can be normed by setting ||x|| = <7(0. x). 

b) Verify that the norm in a vector space X is a continuous function with respect 
to the topology induced by the natural metric (10.3). 

c) Prove that if X is a finite-dimensional vector space and ||x|| and ||jc||' are two 
norms on X, then one can find positive numbers M, N such that 

M||x||<||x|f<iV||x|| (10.19) 


for any vector x e X. 

d) Using the example of the norms ||x||i and ||x||oo in the space £, verify that 
the preceding inequality generally does not hold in infinite-dimensional spaces. 

2. a) Prove inequality (10.5). 

b) Verify relation (10.6). 

c) Show that as p — > +oo the quantity \\f\\ p defined by formula (10.12) tends 
to the quantity ||/|| given by formula (10.1 1). 

3. a) Verify that the normed space £ p considered in Example 7 is complete. 

b) Show that the subspace of l p consisting of finite sequences (ending in zeros) 
is not a Banach space. 

4 . a) Verify that relations (10.11) and (10.12) define a norm in the space C[a, b] 
and convince yourself that a complete normed space is obtained in one of these 
cases but not in the other. 

b) Does formula (10.12) define a norm in the space 7 Z[a.b] of Riemann- 
integrable functions? 

c) What factorization (identification) must one make in lZ[a, b] so that the quan- 
tity defined by (10.12) will be a norm in the resulting vector space? 

5 . a) Verify that formulas (10. 13)— (10. 15) do indeed define an inner product in the 
corresponding vector spaces. 

b) Is the form defined by formula (10.15) an inner product in the space lZ[a, b] 
of Riemann-integrable functions? 
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c) Which functions in lZ[a, b) must be identified so that the answer to part b) 
will be positive in the quotient space of equivalence classes? 

6. Using the Cauchy-Bunyakovskii inequality, find the greatest lower bound of the 
values of the product (jj 7 f(x) dx)(/ 6 (l//)(x) dx) on the set of continuous real- 
valued functions that do not vanish on the closed interval [a, b]. 


10.2 Linear and Multilinear Transformations 
10.2.1 Definitions and Examples 

We begin by recalling the basic definition. 

Definition 1 If X and Y are vector spaces over the same field (in our case, either M 
or C), a mapping A : X -» Y is linear if the equalities 

A(x i +x 2 ) = ACiq) + A(x 2 ), 

A( Xx) = XA(x) 

hold for any vectors x,x\,x 2 in X and any number X in the field of scalars. 

For a linear transformation A : X — >■ Y we often write Ax instead of A (x). 

Definition 2 A mapping A : X i x • ■ • x X n — »■ Y of the direct product of the vector 
spaces X\, .... X n into the vector space Y is multilinear ( n-linear ) if the mapping 
y = A(x i, . . . , x n ) is linear with respect to each variable for all fixed values of the 
other variables. 

The set of n -linear mappings A : X \x- ■■ x X Y will be denoted 
C(X lt ...,X n -Y). 

In particular for n = lwe obtain the set C(X: Y) of linear mappings from X \ = 
X into Y . 

For n = 2 a multilinear mapping is called bilinear , for n = 3, trilinear, and so 
forth. 

One should not confuse an n-linear mapping A e C(X\ , . . . , X n ; Y) with a linear 
mapping A e C(X; Y) of the vector space X = X\ x - ■ ■ xX „ (in this connection 
see Examples 9-1 1 below). 

If Y = M or Y — C, linear and multilinear mappings are usually called linear 
or multilinear functionals. When Y is an arbitrary vector space, a linear mapping 
A : X -* Y is usually called a linear transformation from X into Y . and a linear 
operator in the special case when X — Y. 

Let us consider some examples of linear mappings. 
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Example 1 Let £ be the vector space of finite numerical sequences. We define 
o 

a transformation A : £ -> £ as follows: 
o o 

A((x i,X 2 , ■ ■ ■ , x„, 0, . . .)) := (Lei, 2x2, ■ ■ ■ , nx n , 0, . . .). 

Example 2 We define the functional A : C[a, b] -> R by the relation 

A(f) := f(x o), 

where / e C([a, b], R) and xo is a fixed point of the closed interval [a, b]. 

Example 3 We define the functional A: C([a, b], R) — > R by the relation 

A(f):= f fix) Ax. 

J a 

Example 4 We define the transformation A : C([a,£>],R) — > C([a,£>],R) by the 
formula 

Aif):= ^ fit) dt, 

J a 

where x is a point ranging over the closed interval [a , b] . 

All of these transformations are obviously linear. 

Let us now consider some familiar examples of multilinear mappings. 


Example 5 The usual product (xi, . . . , x n ) i->- xi ■ . . . ■ x n of n real numbers is a 
typical example of an n-linear functional A e £(R, . . . , R; R). 

n 

A 

Example 6 The inner product (xi , xf) i — > (xi , X 2 ) in a Euclidean vector space over 
the field R is a bilinear function. 

A 

Example 7 The cross product (xj,X 2 )i — >[xi,X 2 ] of vectors in three-dimensional 
Euclidean space if is a bilinear transformation, that is, A e C( if , if: if). 


Example 8 If A is a finite-dimensional vector space over the field R, [e \, . . . , e n ] is 
a basis in X, and x — x' e, is the coordinate representation of the vector x e X, then, 
setting 


Aix 1 , . . . , x„) — det 


<x\ • 


U; • 

•• <) 


we obtain an n-linear function A : X" -> R. 
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As a useful supplement to the examples just given, we investigate in addition 
the structure of the linear mappings of a product of vector spaces into a product of 
vector spaces. 

Example 9 Let A = X\ x • • ■ x X m be the vector space that is the direct product of 
the spaces X \ , . . . , X m , and let A : X —*■ Y be a linear mapping of X into a vector 
space Y. Representing every vector x = (x \ , . . . , x m ) e X in the form 

x = (x\, ...,x m ) = 

= (at, 0, 0) + (0, X 2 , 0, . . . , 0) + • • • + (0, . . . , 0, x m ) (10.20) 

and setting 

Aj(xi) := A((0, . . . , 0, Xi, 0, .. . , 0)) (10.21) 

for Xi e X,-, i = (I , we observe that the mappings A, : A, — > Y are linear 
and that 


A(x) — A\(x\) H h A m (x,„). (10.22) 

Since the mapping A : A = A i x • • • x X m —> Y is obviously linear for any linear 
mappings A, : X; — > Y, we have shown that formula (10.22) gives the general form 
of any linear mapping Ae£(I = Xi x x X m : Y). 

Example 10 Starting from the definition of the direct product Y — Y\ x • • • x Y n of 
the vector spaces Y\, . . . ,Y n and the definition of a linear mapping A : X — > Y , one 
can easily see that any linear mapping 

A : A -* Y = Yi x ■ ■ ■ x Y n 

has the form x i-^ Ax = (Ajx, . . . , A n x ) = (y i, . .. , y n ) = y e Y , where A, : A -» 
Y, are linear mappings. 


Example 11 Combining Examples 9 and 10, we conclude that any linear mapping 
A : A! x • • • x X m = A Y = Y { x ■ • • x Y n 


of the direct product A = X\ x • • • x X m of vector spaces into another direct product 
Y = Y\ x ■ • • x Y„ has the form 


y = 


/ yi \ 


V . v « / 


/ An 

\A„ i 


A[m ' 


( X 1 \ 


= Ax, 


Anm ) \Xm ) 


(10.23) 


where A,-/ : X j — > Yi are linear mappings. 

In particular, if X\ — Xj — ■ ■ ■ = X m — R and Y\ = Y 2 = • • • = Y n — R, then 
Aij : A j -» Yi are the linear mappings R3ri-> ajjx e R, each of which is given 
by a single number a,y . Thus in this case relation (10.23) becomes the familiar 
numerical notation for a linear mapping A : R" ! R” . 
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10.2.2 The Norm of a Transformation 


Definition 3 Let A : X \ x • • • x X n — > Y be a multilinear transformation mapping 
the direct product of the normed vector spaces X\, ...,X n into a normed space Y . 
The quantity 


. „ \A(xi,...,x„)\y 

A || := sup , 

XI,..,X„ \xi\ Xl x ••• X \x„\x n 

A', AO 


(10.24) 


where the supremum is taken over all sets xi , . . . , x„ of nonzero vectors in the spaces 
Xi, . . . , X n , is called the norm of the multilinear transformation A. 


On the right-hand side of Eq. (10.24) we have denoted the norm of a vector x 
by the symbol | ■ | subscripted by the symbol for the normed vector space to which 
the vector belongs, rather than the usual symbol || ■ || for the norm of a vector. From 
now on we shall adhere to this notation for the norm of a vector; and, where no 
confusion can arise, we shall omit the symbol for the vector space, taking for granted 
that the norm (absolute value) of a vector is always computed in the space to which 
it belongs. In this way we hope to introduce for the time being some distinction 
in the notation for the norm of a vector and the norm of a linear or multilinear 
transformation acting on a normed vector space. 

Using the properties of the norm of a vector and the properties of a multilinear 
transformation, one can rewrite formula (10.24) as follows: 


|| A || = sup 

A1,..,A„ 

Ai AO 

where the last supremum extends over all sets e \ , . . . , e n of unit vectors in the spaces 
X[, . . . , X n respectively (that is, |e,-| = 1, i = 1, . . . , n). 

In particular, for a linear transformation A : X —*■ Y, from (10.24) and (10.25) 
we obtain 

| Ax | 

|| A || = sup = sup |Ae|. (10.26) 

x^O \x\ |e| = l 

It follows from Definition 3 for the norm of a multilinear transformation A that 
if || A || < oo, then the inequality 

| A(xi, . . . , x n )\ < ||A|||xi| x x |x„| (10.27) 

holds for any vectors x,- e X , , i — 1 , . . . , n . 

In particular, for a linear transformation we obtain 


J _* 1 _ Xn_ 

\l*ll’ l*nl 


= sup | A(e 1; . . . , e n )\, (10.25) 

ei,...,e n 


| Ax | < || A || |x | . 


(10.28) 


In addition, it follows from Definition 3 that if the norm of a multilinear trans- 
formation is finite, it is the greatest lower bound of all numbers M for which the 


10.2 Linear and Multilinear Transformations 


53 


inequality 

| A(xi, ...,x n )\ < M\x\ | x ■ ■ • x \x n \ (10.29) 

holds for all values of x,- e Xj , i = 1 , . . . , n . 

Definition 4 A multilinear transformation A : X i x • • • x X n —*■ Y is bounded if 
there exists MeK such that inequality (10.29) holds for all values of x \, . . . , x n in 
the spaces X\, ... ,X n respectively. 

Thus the bounded transformations are precisely those that have a finite norm. 

On the basis of relation (10.26) one can easily understand the geometric meaning 
of the norm of a linear transformation in the familiar case A : R" ! — »■ R" . In this case 
the unit sphere in R'" maps under the transformation A into some ellipsoid in R" 
whose center is at the origin. Hence the norm of A in this case is simply the largest 
of the semiaxes of the ellipsoid. 

On the other hand, one can also interpret the norm of a linear transformation as 
the least upper bound of the coefficients of dilation of vectors under the mapping, 
as can be seen from the first equality in (10.26). 

It is not difficult to prove that for mappings of finite-dimensional spaces the norm 
of a multilinear transformation is always finite, and hence in particular the norm of 
a linear transformation is always finite. This is no longer true in the case of infinite- 
dimensional spaces, as can be seen from the first of the following examples. 

Let us compute the norms of the transformations considered in Examples 1-8. 

Example V If we regard £ as a subspace of the normed space in which the 
o 

vector e n = (0, . . . , 0, 1, 0. . .) has unit norm, then, since Ae n = ne n , it is clear that 

n — 1 

|| A || = oo. 

Example 2' If |/| = max fl <^<fo |/(x)| < 1, then |A/| = |/(xo)| < and |A/| = 1 
if fix o) = 1, so that ||A|| = 1. 

We remark that if we introduce, for example, the integral norm 

1/1= f l/IWd-t 

J a 

on the same vector space C([a, b], R), the result of computing ||A|| may change 
considerably. Indeed, set [a, b] = [0, 1] and xq— 1. The integral norm of the func- 
tion f n — x n on [0, 1] is obviously while Af n = Ax n = x n \ x -i = 1. It follows 
that || A || = oo in this case. 

Throughout what follows, unless the contrary is explicitly stated, the space 
C([a, b], R) is assumed to have the norm defined by the maximum of the absolute 
value of the function on the closed interval [a,b]. 


54 


10 *Differential Calculus from a More General Point of View 


Example 3' If |/| = ma x a <x<b \f( x )\ < 1- then 


\Af\ = \f b f(x)dx < [ b \f\(x)dx< f 

I J a J a J a 


1 dx = b — a. 


But for f{x) = 1, we obtain | Al \ = b — a, and therefore || A|| = b — a. 
Example 4' If |/| = \f(x)\ < 1, then 


| f 


max / f(t) dr 

a<x<b‘ ' 


a<x<b 


L 


max / |/|(f)dr < max (x — a) = b — a. 


a<x<b 


But for \f\(t) = 1, we obtain 


max / Id t = b — a, 

a<x<bj a 


and therefore in this example || A || = b — a. 

Example 5 ' We obtain immediately from Definition 3 that || A || = 1 in this case. 
Example 6' By the Cauchy-Bunyakovskii inequality 

|(X1,X2>| ^ kll ' \*2\, 

and if .ri = xj, this inequality becomes equality. Hence || A|| = 1. 

Example 7' We know that 


||> 1 ,X 2 ]| = \xi\\x 2 \ sin<p, 

where tp is the angle between the vectors x\ and x 2 , and therefore || A|| < 1. At the 
same time, if the vectors x\ and x 2 are orthogonal, then sin qj — 1. Thus || A|| = 1. 

Example 8' If we assume that the vectors lie in a Euclidean space of dimension n, 
we note that A(jti, . . . , x „ ) = detfxi , . . . , x n ) is the volume of the parallelepiped 
spanned by the vectors xi,...,x n , and this volume is maximal if the vectors 
x\, ... ,x n are made pairwise orthogonal while keeping their lengths constant. 

Thus, 

|det(x 1 ,...,x„)| < |*i | ■ ... • \x n \, 

equality holding for orthogonal vectors. Hence in this case || A|| = 1. 

Let us now estimate the norms of the operators studied in Examples 9-11. We 
shall assume that in the direct product X = X\ x ■ • ■ x X m of the normed spaces 
X[, ... , X m the norm of the vector x = (xi, . . . , x m ) is introduced in accordance 
with the convention in Sect. 10.1 (Example 6). 
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Example 9' Defining a linear transformation 

A:X i x • • • x X m = X Y, 


as has been shown, is equivalent to defining the m linear transformations A, : X, -> 
Y given by the relations A ; x,- = A((0, . . . , 0, Xi, 0, . . . , 0)), i = 1, , m. When this 
is done, formula (10.22) holds, by virtue of which 

m m 

\Ax\y < £ \A-iXi | y < £ ll^lll^lx; < 

i = 1 i = 1 


f m \ 


x\x- 


Thus we have shown that 

m 

ii^ii<£ii a ‘- 

i=i 

On the other hand, since 


\AiXi | = | A((0, . . . , 0, x,-, 0, . . . , 0)) | < 

<||A|||(0,...,0,x i ,0,...,0)| x =||A|||x i |x i , 


we can conclude that the estimate 


At 


< II A 


also holds for all i = 1 , . . . , m . 

Example 10f Taking account of the norm introduce in Y = Y\ x • • • x T„, in this 
case we immediately obtain the two-sided estimates 

n 

\\Ai\\<\\A\\<J2\\Ail 

1=1 

Example IV Taking account of the results of Examples 9 and 10, one can conclude 
that 

m n 

l|A, 7 ||<||A||<^^||A ;7 ||. 

i=l 7=1 


10.2.3 The Space of Continuous Transformations 

From now on we shall not be interested in all linear or multilinear transformations, 
only continuous ones. In this connection it is useful to keep in mind the following 
proposition. 
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Proposition 1 For a multilinear transformation A : X[ x • ■ ■ x X n Y mapping a 
product of normed spaces X\, . . . , X n into a normed space Y the following condi- 
tions are equivalent. 

a) A has a finite norm. 

b) A is a bounded transformation, 

c) A is a continuous transformation, 

d) A is continuous at the point (0, . . . , 0) e X\ x • • • x X n . 

Proof We prove a closed chain of implications a) =b b) =b c) =b d) =>■ a). 

It is obvious from relation (10.27) that a) =b b). 

Let us verify that b) =b c), that is, that (10.29) implies that the operator A is 
continuous. Indeed, taking account of the multilinearity of A, we can write that 

A(x i +h i,X 2 + h 2 , ...,x n + h n ) - A(xi,X 2 , ■ ■ .,x n ) = 

= A(h i,X 2 , ■ ■ . ,x n ) H b A(x \,X 2 , ■ ■ .,x„-i,h„) = 

+ A(h\, h 2 ,X 3 , . . . ,x n ) d b A(x i, . . .,x n - 2 , h n -i,h n ) + 

+ • • • + A(h i , . . . , h n ). 

From (10.29) we now obtain the estimate 

| A(xi + hi, X 2 + h 2 , . ■ ■ , x„ + hn) - A(x\,X2 Xn) | < 


<M(\hi\- \x 2 \ ■ ... • \x n \-\ b \x\ | • \x 2 \ ■■■■■ \x n -i \ ■ \h n \ + 


+ --- + \hl\-...-\hn\). 


from which it follows that A is continuous at each point (x \, . . . , x n ) e X\ x • • • x 
X n . 

In particular, if (xi, . . . , x„) = (0, . . . , 0) we obtain d) from c). 

It remains to be shown that d) =b a). 

Given e > 0 we find S — S(s) > 0 such that \A{x\, . . . ,x n )\ < e when max{|xi |, 
. . . , \x n |} < S. Then for any set ci .... , e„ of unit vectors we obtain 



that is, || A || < Jr < oo. 


□ 


We have seen above (Example 1) that not every linear transformation has a finite 
norm, that is, a linear transformation is not always continuous. We have also pointed 
out that continuity can fail for a linear transformation only when the transformation 
is defined on an infinite-dimensional space. 
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From here on C(X i , . ... X n : Y) will denote the set of continuous multilinear 
transformations mapping the direct product of the normed vector spaces X\, . . . ,X n 
into the normed vector space Y. 

In particular, C(X; Y ) is the set of continuous linear transformations from X 
into Y. 

In the set C(X\ , . ... X„\ Y) we introduce a natural vector-space structure: 

(A + B) Oi, . . . , x n ) := A(x i x n ) + B(x\, ..., x n ) 

and 

(AA)(xi , . . . , x n ) := XA(x\ , . . . , x„). 

It is obvious that if A, B e C(X i, . . . , X n \ Y), then (A + B) e C(X i, . . . , X n \ Y) 
and Q,A)&C(Xi,...,X n \Y). 

Thus C{X i , . ... X n : Y) can be regarded as a vector space. 

Proposition 2 The norm of a multilinear transformation is a norm in the vector 
space L{X \ , . . . , X n \ Y) of continuous multilinear transformations. 

Proof We observe first of all that by Proposition 1 the nonnegative number || A|| < 
oo is defined for every transformation A e CiX \ . . , X„; Y). 

Inequality (10.27) shows that 


A|| =0<S> A = 0. 


Next, by definition of the norm of a multilinear transformation 


II A || = sup 

x\,...,x n 
x tl i 0 


(^A)(.ri, ...,.r„)| 
\xi\- ...■ \x n \ 


|A.||A(xi, ...,x„)\ 

sup 

Xi,...,x„ \xi\ ■ . . . ■ \x n \ 
x& 0 


= 1^1 II A || . 


Finally, if A and B are elements of the space C(X \. . . . , X n ; T), then 


A + B 


\(A + B)(xi , . . . , x„)| 

sup 

xi,...,x n \X\ \ ■ . . . ■ \x n \ 

xi+ 0 


\A(x\,...,x n ) + B(x\,...,x n )\ 

sup 

xi, ...,x n \X]_\ ■ . . . ■ \x n \ 

Xi+ o 


|A(xi, . . . ,x n )\ \B{xi, ...,x„)| 

sup f sup 

XI,. ...Xn |Xl| ■ ■ |x„| xi,..., Xn |Xl| • • |x„| 

Xi^O Xii= 0 


B 


□ 
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From now on when we use the symbol £(X \, ... ,X n \ Y ) we shall have in mind 
the vector space of continuous n-linear transformations normed by this transforma- 
tion norm. In particular £(X , Y) is the normed space of continuous linear transfor- 
mations from X into Y . 

We now prove the following useful supplement to Proposition 2 . 

Supplement If X, Y , and Z are normed spaces and A e £(X: Y) and B e 
£{Y ; Z), then 


B o A|| < ||£ || ■ || A 


Proof Indeed, 


ID ah |(fioA)x| ||fi|||Ax| 

\BoA\\ — sup < sup 

x^O W x^O W 
| Ax | 

= || fi|| sup = || fi|| • || A || . 

X^iO W 


□ 


Proposition 3 If Y is a complete normed space, then £(X \ , ... , X n ; Y) is also a 
complete normed space. 


Proof We shall carry out the proof for the space £(X\ Y ) of continuous linear trans- 
formations. The general case, as will be clear from the reasoning below, differs only 
in requiring a more cumbersome notation. 

Let Ai, A2, . . . , A„, . . . be a Cauchy sequence in £{X\ F). Since for any x e X 
we have 

I A m x A h x| = | (A m A„ )x | < || A m A, ? |||x|, 

it is clear that for any x e X the sequence Aix, A2X, . . . , A„x, ... is a Cauchy se- 
quence in F. Since F is complete, it has a limit in F, which we denote by Ax. 

Thus, 


Ax := lim A„x. 

n — > 00 

We shall show that A : X — > F is a continuous linear transformation. 

The linearity of A follows from the relations 

lim A„(A.iXi +A.2X2) = lim (X\A n x\ + X 2 A n x 2 ) = 

n— >00 n — > 00 

= Lj lim A„xi + A.2 lim A„x 2. 

n — >00 n—±o o 

Next, for any fixed s > 0 and sufficiently large values of m , n e N we have || A m — 
A„ || < e, and therefore 


|A,„x - A„x| < e|x| 
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at each vector x e X. Letting m tend to infinity in this last relation and using the 
continuity of the norm of a vector, we obtain 

| Ax — A n x | < s\x\. 

Thus || A — A„ || < s, and since A = A„ + (A — A„), we conclude that 

|| A || < || A n || -f- £. 

Consequently, we have shown that A e £(X; F) and || A — A„ || —*■ 0 as n —*■ oo, that 
is, A = lim„_ i , 0 o A n in the sense of the norm of the space £{X: Y). □ 

In conclusion, we make one special remark relating to the space of multilinear 
transformations, which we shall need when studying higher-order differentials. 

Proposition 4 For each m e {1, ... ,n} there is a bijection between the spaces 
C(X u ...,X m \ C(X m+l ,...,X n \Y)) and C{X\, ..., X n ; Y) 
that presences the vector-space structure and the norm. 


Proof We shall exhibit this isomorphism. 

Let *8 e £(AT, . . . , X m ; £(X m +\, . . . , X n \ F)), that is, 18(xi, . . . , x m ) e 
C(X m+l ,...,X n -Y). 

We set 


Then 


A(x i x „ ) : = 55 (x i , . . . , x m ) (x m+ \ ,...,x n ). 


© 


\mxi,...,x m )\\ 

sup 

xi,...,x m \X{ | ' . • ■ \x m | 

Xi?0 


(10.30) 


= sup 

xi....,x m 
x t + 0 


SUp.r m +i,...,x n 

x jA o 


| > B(.vi,...,jr,„)(.v m+ i,...,jrn) 

fim + l I ■-■■■ \Xn | 


\xi | ■ . . . ■ \x m I 


I A(xi, ...,x n )\ 

sup 

XI,..., x n \xi\ ■ . . . ■ \x n \ 
xkA® 


A 


We leave to the reader the verification that relation (10.30) defines an isomor- 
phism of these vector spaces. □ 


Applying Proposition 4 n times, we find that the space 
C(X l -C(X 1 -...-C(X n -Y))---) 


is isomorphic to the space C(X\, . ... X n : Y) of n -linear transformations. 
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10.2.4 Problems and Exercises 

1. a) Prove that if A : X — * Y is a linear transformation from the normed space 
X into the normed space Y and X is finite-dimensional, then A is a continuous 
operator. 

b) Prove the proposition analogous to that stated in a) for a multilinear operator. 

2. Two normed vector spaces are isomorphic if there exists an isomorphism be- 
tween them (as vector spaces) that is continuous together with its inverse transfor- 
mation. 

a) Show that normed vector spaces of the same finite dimension are isomorphic. 

b) Show that for the infinite-dimensional case assertion a) is generally no longer 
true. 

c) Introduce two norms in the space C ([a, b], R) in such a way that the identity 
mapping of C([a. b], R) is not a continuous mapping of the two resulting normed 
spaces. 

3. Show that if a multilinear transformation of n -dimensional Euclidean space is 
continuous at some point, then it is continuous everywhere. 

4. Let A : E n — > E" be a linear transformation of n -dimensional Euclidean space 
and A* : E" -> E" the adjoint to this transformation. 

Show the following. 

a) All the eigenvalues of the operator A • A* : E n —*■ E n are nonnegative. 

b) If A.i < • • • < X n are the eigenvalues of the operator A • A*, then || A|| = 

c) If the operator A has an inverse A -1 : E n — » E n , then || A - 1 1| = -^=. 

d) If (a 1 -) is the matrix of the operator A : E" — > E" in some basis, then the 
estimates 

n n 

max T>*,.) 2 < || A || < (a'.)“ < \/n||A|| 

\| j =1 \| «.I=i 

hold. 

5. Let P[x] be the vector space of polynomials in the variable x with real coeffi- 
cients. We define the norm of the vector P e P[x] by the formula 

1^1 = 

a) Is the operator D : P[x] —> P[jc] given by differentiation ( D(P(x )) := P'{x)) 
continuous in the resulting space? 

b) Find the norm of the operator F : P[x] -» P[x] of multiplication by x, which 
acts according to the rule F(P(x)) = x ■ P (x ) . 
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6. Using the example of projection operators in E 2 , show that the inequality || B o 
A || < ||B|| ■ || A || may be a strict inequality. 


10.3 The Differential of a Mapping 
10.3.1 Mappings Differentiable at a Point 


Definition 1 Let X and Y be normed spaces. A mapping / : E — »■ Y of a set E C X 
into Y is differentiable at an interior point x e E if there exists a continuous linear 
transformation L(x) : X — »• Y such that 

/( x + h) — fix ) = L{x)h + a(.x'; h), (10.31) 

where a(x\ h) = o(h) as h — * 0, x + h e E. 1 


Definition 2 The function L(x) e £(X: Y) that is linear with respect to h and satis- 
fies relation (10.31) is called the differential , the tangent mapping, or the derivative 
of the mapping f : E — ► Y at the point x. 

As before, we shall denote L(x) by d fix), Df(x), or fix). 

We thus see that the general definition of differentiability of a mapping at a point 
is a nearly verbatim repetition of the one already familiar to us from Sect. 8.2, where 
it was considered in the case X = ]R" ! , Y — M" . For that reason, from now on we 
shall allow ourselves to use such concepts introduced there as increment of a func- 
tion, increment of the argument , and tangent space at a point without repeating the 
explanations, preserving the corresponding notation. 

We shall, however, verify the following proposition in general form. 

Proposition 1 If a mapping f : E — > Y is differentiable at an interior point x of a 
set E C X, its differential L(x) at that point is uniquely determined. 


Proof Thus we are verifying the uniqueness of the differential. 

Let L i(x) and Liix) be linear mappings satisfying relation (10.31), that is 


fix + h) — fix) — L\{x)h — affx\ h), 
fix + h) - f{x) - L 2 ix)h — a .2 ix\ h), 


(10.32) 


where a; (x ; h) = oih) as h — > 0, x + h e E, i = 1, 2. 


'The notation “ a{x\ h) = o(/z) as It — > 0, x + h € E”, of course, means that 
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Then, setting L{x) — L 2 OO — L\(x) and afx\ h ) = ai{x\ h) — a\ (x\ h ) and sub- 
tracting the second equality in (10.32) from the first, we obtain 

L(x)h = a(x\ h). 

Here L(x) is a mapping that is linear with respect to h, and a (x : h ) = o(h) as h — >■ 0. 
x + h e E. Taking an auxiliary numerical parameter X, we can now write 

| | = WjKU)! = 

11 \X\ \Xh\ 1 1 

Thus L(x)h — 0 for any h ^ 0 (we recall that x is an interior point of E). Since 
L(x) 0 = 0, we have shown that L\(x)h = L 2 (x)h for every value of h. □ 

If E is an open subset of X and / : E —*■ Y is a mapping that is differentiable at 
each point x e E, that is, differentiable on E, by the uniqueness of the differential of 
a mapping at a point, which was just proved, a function E 3 x \-r f'(x) e C(X: Y ) 
arises on the set E, which we denote /' : E C(X: Y). This mapping is called 
the derivative of f, or the derivative mapping relative to the original mapping 
/ : E — ¥■ Y . The value fix) of this function at an individual point x e E is the 
continuous linear transformation fix) e C(X; Y) that is the differential or deriva- 
tive of the function / at the particular point x e E . 

We note that by the requirement of continuity of the linear mapping L(x) 
Eq. (10.31) implies that a mapping that is differentiable at a point is necessarily 
continuous at that point. 

The converse is of course not true, as we have seen in the case of numerical 
functions. 

We now make one more important remark. 

Remark If the condition for differentiability of the mapping / at some point a is 
written as 


fix) — fia) = Lf)ix — a) + «(a; x), 

where a(a;x) = oix — a) as x -3- a, it becomes clear that Definition 1 actually 
applies to a mapping / : A — > B of any affine spaces (A, X) and (5, Y) whose 
vector spaces X and Y are normed. Such affine spaces, called normed affine spaces, 
are frequently encountered, so that it is useful to keep this remark in mind when 
using the differential calculus. 

Everything that follows, unless specifically stated otherwise, applies equally to 
both normed vector spaces and normed affine spaces, and we use the notation for 
vector spaces only for the sake of simplicity. 
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10.3.2 The General Rules for Differentiation 

The following general properties of the operation of differentiation follow from Def- 
inition 1. In the statements below X, Y, and Z are normed spaces and U and V open 
sets in X and Y respectively. 


a. Linearity of Differentiation 

If the mappings f:U->Y,i — 1,2, are differentiable at a point x e U , a linear 
combination of them {X\f\ + A. 2 f 2) : U — * Y is also differentiable at x, and 

(X 1 /1 + X 2 fiY (x) = A 1 f[ (x) + Xi fix) . 

Thus the differential of a linear combination of mappings is the corresponding 
linear combination of their differentials. 


b. Differentiation of a Composition of Mappings (Chain Rule) 

If the mapping f : U —*■ V is differentiable at a point x G U C X. and the mapping 
g : V —r Z is differentiable at f(x) — y e V C Y , then the composition g o f of 
these mappings is differentiable at x, and 

( g ° /)'(*) = g'{f{x )) o f{x). 

Thus, the differential of a composition is the composition of the differentials. 


c. Differentiation of the Inverse of a Mapping 

Let f : U —*■ Y be a mapping that is continuous at x e U C X and has an inverse 
f~ l : V — > X that is defined in a neighborhood of y — fix) and continuous at that 
point. 

If the mapping f is differentiable at x and its tangent mapping fix) e C{X\ Y) 
has a continuous inverse [/^(x)] -1 € C(Y\ X), then the mapping f ~ 1 is differen- 
tiable at y = fix) and 


[r 1 ]\f(x)) = [fix)]-\ 

Thus, the differential of an inverse mapping is the linear mapping inverse to the 
differential of the original mapping at the corresponding point. 

We omit the proofs of a, b, and c, since they are analogous to the proofs given in 
Sect. 8.3 for the case X — M m , Y — R". 
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10.3.3 Some Examples 


Example 1 If / : U — »■ Y is a constant mapping of a neighborhood U — Ulx)<zX 
of the point x, that is, flU) — yo e Y , then fix) = 0e C(X; Y). 

Proof Indeed, in this case it is obvious that 


fix + h)~ f[x) - Oh = }’ 0 - }'o -0 = 0= o{h). □ 

Example 2 If the mapping / : X — > Y is a continuous linear mapping of a normed 
vector space X into a normed vector space Y, then fix) = f e C(X; Y) at any 
point x e A. 

Proof Indeed, 


fix + h) - fix) - fh = fx + fh - fx - fh = 0. □ 

We remark that strictly speaking ffx)eC(T X x \ TYtt x f) here and h is a vector 
of the tangent space T X x . But parallel translation of a vector to any point x e X is 
defined in a vector space, and this allows us to identify the tangent space T X x with 
the vector space X itself. (Similarly, in the case of an affine space (A, X) the space 
T A a of vectors “attached” to the point a e A can be identified with the vector space 
X of the given affine space.) Consequently, after choosing a basis in X, we can 
extend it to all the tangent spaces T X x . This means that if, for example, X = R"' , 
Y — R", and the mapping / e £(R' n ; R") is given by the matrix la-), then at every 
point x e R m the tangent mapping fix) : TM!" — »■ will be given by the 

same matrix. 

/ 

In particular, for a linear mapping x i— ■* ax = y from R to M with x e R and 

/' 

h e TR A . ~ R, we obtain the corresponding mapping T R A 9 h i — > ah e TRy( A ). 

Taking account of these conventions, we can provisionally state the result of 
Example 2 as follows: The mapping /' : X — > Y that is the derivative of a linear 
mapping f : X Y of normed spaces is constant, and fix) = f at each point 

X6l 

Example 3 From the chain rule for differentiating a composition of mappings and 
the result of Example 2 one can conclude that if / : U — > Y is a mapping of a 
neighborhood U — U lx) C X of the point x e X and is differentiable at x, while 
A e C(Y\ Z), then 

(A o f)' lx) = A o f' lx). 

For numerical functions, when Y — Z — R, this is simply the familiar possibility 
of moving a constant factor outside the differentiation sign. 
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Example 4 Suppose once again that U = U(x) is a neighborhood of the point x in 
a normed space X, and let 


f :U —*■ Y = Y\ x ■ ■ ■ xY n 

be a mapping of U into the direct product of the normed spaces Y\, ... , Y n . 

Defining such a mapping is equivalent to defining the n mappings f, : U — > Y , , 
i = 1 connected with / by the relation 

x h* fix) = y = (yi,'.., y n ) = (/i(x), . . . , f n (x )) , 

which holds at every point of U . 

If we now take account of the fact that in formula (10.31) we have 

fix + h ) - fix ) = (/l (x + h) - flix), . . . , fn lx + h)~ fn lx )) , 

Lix)h = (Liix)h, . . . , L„ix)h), 
a(x; h) = (ai(x; h). . . . , a n (x; h)), 

then, referring to the results of Example 6 of Sect. 10.1 and Example 10 of 
Sect. 10.2, we can conclude that the mapping / is differentiable at x if and only 
if all of its components f-.U—*- Y, are differentiable at x,i = 1 and when 
the mapping / is differentiable, we have the equality 

f\x) = {f{ix),...,f'ix)). 


Example 5 Now let A e C(X \, .... X n \ Y), that is, A is a continuous ;?-linear trans- 
formation from the product X i x ■ ■ • x X n of the normed vector spaces X \ ..... X„ 
into the normed vector space Y. 

We shall prove that the mapping 

A: X\ x-"Xl„ = X->r 
is differentiable and find its differential. 

Proof Using the multilinearity of A, we find that 


A(x + h) — A(x) = A(x i + h i, ... ,x„ + h„) — A(x i, . . . , x n ) = 

= A(xi ,x n ) + A(hu x' 2 , . . . ,x„) + 

H h A (x i , . . . ,x„_i, h n ) + A(/?i, /?2,X3, . . . ,x„) + 

+ •••-)- A(xi, ... , x n - 2 , h, ,_i , h n ) + 

H b A (/? i , . . . , h n ) - A(xi, x n ). 

Since the norm in X = X\ x • ■ ■ x X n satisfies the inequalities 

n 

\Xi\x t < l-rlx 

i = 1 
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and the norm || A|| of the transformation A is finite and satisfies 
|A(§i,...,? b )|<||A||||i|x...x|§„|, 


we can conclude that 


A(x + h ) — A(x) — A(x\ + h\, .... x„ + h n ) — A(x\, ... ,x n ) — 

— A(h\,X2 ,x n )-\ b A(x i, . . ,,x„-i,h n ) + a(x; h), 


where a(x\ h) — o(h ) as h — > 0 . 
But the transformation 


L(x)h = A(h i,X2, ...,x„)-\ b A(x\ ,x n -\ ,h n ) 

is a continuous transformation (because A is continuous) that is linear in h — 
(hi,...,h n ). 

Thus we have established that 

A'(x)h = A' {x i, . . . , x n )(h \, . . . , h n ) = 

— A(/?i , x 2 , ■ ■ .,x n ) H b A{x i, . . ,,x n -i,h n ), 


or, more briefly, 


dA(x \ , ...,x n ) = A(d.ri,X2, ...,x n )-\ b A(x i, . . .,x„-i,dx„). □ 


In particular, if: 

a) xi ■ . . . ■ x n is the product of n numerical variables, then 

d(xi • . . . • x n ) — dxj • x 2 ■ ■ ■ ■ ■ x„ H b xi ■ . . . ■ x n -\ ■ dx„; 

b) (x\ , x 2 ) is the inner product in E 3 , then 


d(xi , X2) = (dxi , x 2 ) + {x\ , dx 2 ); 

c) [xi , x 2 ] is the vector cross product in Zs 3 , then 

d[xi , x 2 ] = [dxi , x 2 \ + [x\ , dx 2 \\ 

d) (xi ,x 2 ,xi) is the scalar triple product in £ 3 , then 

d(xi,x 2 ,xi) = (dxi,X2,xj) + (X 2 , dx2, X3) + (x 2 ,x 2 , dx3); 

e) det(xi, . . . , x n ) is the determinant of the matrix formed from the coordinates 
of n vectors x\, ... ,x n in an n -dimensional vector space X with a fixed basis, then 

d(det(xi , . . . , x„)) = det(dxi, X2, . . . , x„) H b det(xi , . . . , x„_i, dx,;). 
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Example 6 Let U be the subset of C(X\ Y) consisting of the continuous linear trans- 
formations A : X Y having continuous inverse transformations A -1 : Y -* X 
(belonging to C(Y : X )). Consider the mapping 

U 3 A\-+ A~ l eC(Y; X), 

which assigns to each transformation A e U its inverse A -1 e C(Y ; X). 

Proposition 2 proved below makes it possible to determine whether this mapping 
is differentiable. 

Proposition 2 If X is a complete space and A e U, then for any h e C(X\ Y) such 
that || /; || < || A~ 1 1| ~ 1 , the transformation A + h also belongs to U and the following 
relation holds: 

(A + h)~ l = A -1 — A~ i hA~ 1 + o(h) as h -» 0. (10.33) 

Proof Since 

(A + hy 1 = (A(£ + A -1 /?)) -1 = (E + A~' h)~ l A~ l , (10.34) 

it suffices to find the operator (E + A~ l h)~ l inverse to (E + A~ l h) e C(X\ X), 
where E is the identity mapping ex of X into itself. 

Let A := — A~ l h . Taking account of the supplement to Proposition 2 of 
Sect. 10.2, we can observe that II A|| < || A -1 1| • ||/z||, so that by the assumptions 
made with respect to the operator h we may assume that II A|| <q< 1. 

We now verify that 

(E - A)~ l = E + A + A 2 + ■■■ + A" + ■■■ , (10.35) 

where the series on the right-hand side is formed from the linear operators A n = 
(Ao-.-o A) g£(X; X). 

Since A - is a complete normed vector space, it follows from Proposition 3 of 
Sect. 10.2 that the space C(X; X) is also complete. It then follows immediately 
from the relation ||A"|| < ||A||" < q" and the convergence of the series YlwLo c l n 
for \q\ < 1 that the series (10.35) formed from the vectors in that space converges. 
The direct verification that 

(E + A + A 2 + - ■■)(£- A) = 

= (E + A + A 2 + ■ ■ ■ ) - ( A + A 2 + A 3 + • • • ) = E 


and 


(E- A)(E + A + A 2 + ---) = 

= (E + A + A 2 + ■ ■ ■ ) - (A + A 2 + A 3 + ■ • ■ ) = E 


shows that we have indeed found (E — A) 1 
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It is worth remarking that the freedom in carrying out arithmetic operations on 
series (rearranging the terms!) in this case is guaranteed by the absolute convergence 
(convergence in norm) of the series under consideration. 

Comparing relations (10.34) and (10.35), we conclude that 


(A + h)~' = A -1 - A~ l hA~ l + (A~ l h) 2 A~ l - 

+ (-l )"(A _1 /i)"A _1 + --- (10.36) 


for \\h\\ < HA- 1 !!- 1 
Since 


n= 2 


< ^||A“ 1 / i ||"|A“ 1 | < 

n= 2 


< | A -1 1 3 1| /t || 2 q m 

m = 0 


A - 1 !! 3 
i -q 


2 


Eq. (10.33) follows in particular from (10.36). □ 

Returning now to Example 6, we can say that when the space X is complete the 

f 

mapping A i — > A -1 under consideration is necessarily differentiable, and 
d f(A)h = d (A~ l )h = -A~ l hA~ l . 

In particular, this means that if A is a nonsingular square matrix and A -1 is its 
inverse, then under a perturbation of the matrix A by a matrix h whose elements are 
close to zero, we can write the inverse matrix (A + li)~ l in first approximation in 
the following form: 

(A + /7) -1 » A -1 - A~ l hA~ l . 

More precise formulas can obviously be obtained starting from Eq. (10.36). 
Example 7 Let X be a complete normed vector space. The important mapping 

exp :C(X; X) C(X; X) 

is defined as follows: 

exp A:— E + A + ^- A 2 -\ h — ? A" + • • • , (10.37) 

1! 2! n\ 

if AeC(X;X). 

The series in (10.37) converges, since C(X\ X) is a complete space and 
II A" || < while the numerical series Y^n= o converges. 
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It is not difficult to verify that 


exp(A + h) — exp A + L{A)h + o(Jt ) as h — > oo, 


(10.38) 


where 



+ • • • + — (A" -1 /? + A n ~ 2 hA + • • • + A/?A "“ 2 + hA n ~ l ) + ■■■ 

in ! ^ ' 


n\ 


and ||L(A)|| <exp||A|| = e l|A H, that is, L(A) e £(£(Z; X),C{X\ X)). 

Thus, the mapping £(X; 1)3 At^ exp A e £(X: X ) is differentiable at every 
value of A. 

We remark that if the operators A and h commute, that is, Ah — h A, then, as one 
can see from the expression for L(A)h, in this case we have L(A)h = (exp A)li. In 
particular, for X = R or X = C, instead of (10.38) we again obtain 


exp(A + h) — exp A + (expA)/t + o(h) as h — »■ 0. (10.39) 


Example 8 We shall attempt to give a mathematical description of the instantaneous 
angular velocity of a rigid body with a fixed point o (a top). Consider an orthonormal 
frame {ei,e 2 ,e 3 } at the point o rigidly attached to the body. It is clear that the 
position of the body is completely characterized by the position of this orthoframe, 
and the triple {ei, £ 2 , 63 } of instantaneous velocities of the vectors of the frame 
obviously give a complete characterization of the instantaneous angular velocity of 
the body. The position of the frame itself {ei, e 2 , 63 ) at time t can be given by an 
orthogonal matrix ( aj ), i, j — 1 , 2, 3 composed of the coordinates of the vectors 
ei, e 2 , e 3 with respect to some fixed orthonormal frame in space. Thus, the motion 
of the top corresponds to a mapping 1 1 ->- 0(t) from R (the time axis) into the group 
SO( 3) of special orthogonal 3x3 matrices. Consequently, the angular velocity of 
the body, which we have agreed to describe by the triple {ei , 62 , £ 3 }, is the matrix 
0{t) which is the derivative of the matrix O(f) = ( aj)(t ) 

with respect to time. 

Since O(f) is an orthogonal matrix, the relation 


0(f)0*(f) = E 


(10.40) 


holds at any time f, where 0*(f) is the transpose of O(f) and E is the identity 
matrix. 

We remark that the product A ■ B of matrices is a bilinear function of A and B, 
and the derivative of the transposed matrix is obviously the transpose of the deriva- 
tive of the original matrix. Differentiating (10.40) and taking account of these things, 
we find that 


0(t)0*(t) + 0(t)0*(t) = 0 
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or 

d(t) = -0(t)d*(t)0(t), (10.41) 

since 0*(t)0(t) = E. 

In particular, if we assume that the frame {ei , ei, e3} coincides with the spatial 
frame of reference at time t, then 0(f) = E, and it follows from (10.41 ) that 

O(t) = — 0*(f), (10.42) 

that is, the matrix 0(f) =: 12(f) = (coj ) of coordinates of the vectors ( e [ , 62, e.3 1 in 
the basis {ei , e2, 63} turns out to be skew-symmetric: 



(a>\ 

c °\ 

a>\\ 


( 0 

-« 3 

« 2 

\ 

Q(t) = 

a>\ 

CO 2 

3 

(°2 

= 

« 3 

0 

-n ; 1 




a>\ 




cu 1 

0 

/ 


Thus the instantaneous angular velocity of a top is actually characterized by three 
independent parameters, as follows in our line of reasoning from relation (10.40) 
and is natural from the physical point of view, since the position of the frame 
{ei,e 2 ,e 3}, and hence the position of the body itself, can be described by three 
independent parameters (in mechanics these parameters may be, for example, the 
Euler angles). 

If we associate with each vector (o — co l e 1 + are 2 + or 3 e 3 in the tangent space at 
the point o a right-handed rotation of space with angular velocity a) about the axis 
defined by this vector, it is not difficult to conclude from these results that at each 
instant of time f the body has an instantaneous angular velocity and that the velocity 
at that time can be adequately described by the instantaneous angular velocity vector 
co(t) (see Problem 5 below). 


10.3.4 The Partial Derivatives of a Mapping 

Let U — U{a) be a neighborhood of the point a e X = X \ x • • • x X m in the direct 
product of the normed spaces Xi , . . . , X m , and let / : U — »■ Y be a mapping of U 
into the normed space V . In this case 

y = f(x) = nxu...,x m ), (10.43) 

and hence, if we fix all the variables but x,- in (10.43) by setting Xk — at for k e 
{ 1 , . . . , m}\i , we obtain a function 

f(a 1 , . . . , ai-i , Xi , a i+ u . . . , a m ) =: cpi (x;), (10.44) 

defined in some neighborhood (/, of a, in X. 
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Definition 3 Relative to the original mapping (10.43) the mapping </>,- : 17, -> Y is 
called the partial mapping with respect to the variable x; at a e X. 

Definition 4 If the mapping (10.44) is differentiable at x ; = a; , its derivative at that 
point is called the partial derivative or partial differential of f at a with respect to 
the variable x ; . 

We usually denote this partial derivative by one of the symbols 
3 if (a), D, f (a), M-(a), f'fa). 

u X[ 

In accordance with these definitions Di f(a) e C(Xi;Y). More precisely, 
D i f(a)eC(TX i (a i )-TY(f(a))). 

The differential d f(ci) of the mapping (10.43) at the point a (if / is differen- 
tiable at that point) is often called the total differential in this situation in order to 
distinguish it from the partial differentials with respect to the individual variables. 

We have already encountered all these concepts in the case of real-valued func- 
tions of m real variables, so that we shall not give a detailed discussion of them. We 
remark only that by repeating our earlier reasoning, taking account of Example 9 in 
Sect. 9.2, one can prove easily that the following proposition holds in general. 

Proposition 3 If the mapping (10.43) is differentiable at the point a — (a i , . . . , a m ) 
e X\ x • • • x X m = X, it has partial derivatives with respect to each variable at 
that point , and the total differential and the partial differentials are related by the 
equation 

d f(a)h = d\f(a)h\ H f 3 m f(a)h m , (10.45) 

where h = {h\ h m ) e TX\{a\) x • • • x TX m (a m ) — TX(a). 


We have already shown by the example of numerical functions that the existence 
of partial derivatives does not in general guarantee the differentiability of the func- 
tion (10.43). 


10.3.5 Problems and Exercises 


1. a) Let A e C(X; X) be a nilpotent operator , that is, there exists ieN such 
that A k = 0. Show that the operator (E — A) has an inverse in this case and that 
(E - A)” 1 = E + A + ■ ■ ■ + A k ~ l . 

b) Let D : lP[v] — »• P[x] be the operator of differentiation on the vector space 
P[x] of polynomials. Remarking that I) is a nilpotent operator, write the operator 
exp(flD), where a e M, and show that exp(aD)(P(x)) = P(x + a) =: T a (P(x)). 
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c) Write the matrices of the operators D : P„[x] -* P„[x] and T a : P„[x] — > 
P„[x] from part b) in the basis e; = , 1 < i < n, in the space P„[x] of real 

polynomials of degree n in one variable. 

2. a) If A, B e C(X; X) and 3B _1 e C(X\ X), then exp(B- J AB) = B" 1 (exp A)B. 

b) If AB = BA, then exp(A + B) = expA • exp B. 

c) Verify that expO = E and that exp A always has an inverse, namely 
(exp A) -1 = exp(— A). 

3. Let A e C(X; X). Consider the mapping (Pa ■ R —*■ C(X; X) defined by the cor- 
respondence Esf h- exp(f A) e C(X\ X). Show the following. 

a) The mapping ip a is continuous. 

b) ip a is a homomorphism of R as an additive group into the multiplicative group 
of invertible operators in C(X; X). 

4. Verify the following. 

a) If Xi,...,X n are the eigenvalues of the operator A e £(C";C"), then 
exp 7. i , . . . , exp a ( , are the eigenvalues of exp A. 

b) det(exp A) = expftr A), where tr A is the trace of the operator A e C(C n , C"). 

c) If A e £ (R” . R” ) , then det(expA) > 0. 

d) If A* is the transpose of the matrix A e £( C" , C") and A is the matrix whose 
elements are the complex conjugates of those of A, then (exp A)* = exp A* and 
exp A = exp A . 

e) The matrix ( “* _°j ) is not of the form exp A for any 2x2 matrix A. 

5. We recall that a set endowed with both a group structure and a topology is called 
a topological group or continuous group if the group operation is continuous. If 
there is a sense in which the group operation is even analytic, the topological group 
is called a Lie group r 

A Lie algebra is a vector space X with an anticommutative bilinear operation [ , ] : 
X x X X satisfying the Jacobi identity, [fa, ft], c] + [[ft, c], a] + [[c, a], ft] =0 
for any vectors a, ft, c e X. Lie groups and algebras are closely connected with each 
other, and the mapping exp plays an important role in establishing this connection 
(see Problem 1 above). 

An example of a Lie algebra is the oriented Euclidean space £ 2 3 with the operation 
of the vector cross product. For the time being we shall denote this Lie algebra by 
LAi. 

a) Show that the real 3x3 skew- symmetric matrices form a Lie algebra (which 
we denote LAi) if the product of the matrices A and B is defined as [A, B] = 
AB - BA. 


2 For the precise definition of a Lie group and the corresponding reference see Problem 8 in 

Sect. 15.2. 
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b) Show that the correspondence 



•O- (o>l , C 02 , C 03 ) — co 


is an isomorphism of the Lie algebras LA 2 and LA 1 . 

c) Verify that if the skew-symmetric matrix 17 and the vector co correspond to 


each other as shown in b), then the equality 17 r = [co, r] holds for any vector r e £ 3 , 


and the relation PL2 P~ l -o- Pco holds for any matrix P e SO(3). 

d) Verify that if M 9 1 i-»- ()(t) e SO (3 ) is a smooth mapping, then the matrix 
12(r) — 0~ l {t)0(t) is skew-symmetric. 

e) Show that if r (t) is the radius vector of a point of a rotating top and 17(f) is 
the matrix (O -1 0)(t) found in d), then r (t) — (17r)(f). 

f) Let r and co be two vectors attached at the origin of E 3 . Suppose a right- 
handed frame has been chosen in E 3 , and that the space undergoes a right-handed 
rotation with angular velocity |o>| about the axis defined by co. Show that r(f) = 
[co, r(f)] in this case. 

g) Summarize the results of d), e), and f) and exhibit the instantaneous angular 
velocity of the rotating top discussed in Example 8. 

h) Using the result of c), verify that the velocity vector co is independent of the 
choice of the fixed orthoframe in E 3 , that is, it is independent of the coordinate 
system. 

6. Let r = r(,v) = U' 1 (s), x 2 (s), x 3 (s)) be the parametric equations of a smooth 
curve in E 3 , the parameter being arc length along the curve (the natural parametriza- 
tion of the curve). 

a) Show that the vector ei l.v) = gj(s) tangent to the curve has unit length. 

b) The vector ^(s) = is orthogonal to ej. Let e2 (s) be the unit vector 

formed from ^ ( s ). The coefficient k(s) in the equality ^ (s) — k(s)e2(s) is called 
the curvature of the curve at the corresponding point. 

c) By constructing the vector 63^) = [ei (s), e2(^)] we obtain a frame {ei, e2, 63} 
at each point, called the Frenet frame 3 or companion trihedral of the curve. Verify 
the following Frenet formulas: 



k(s)e 2 (s), 


— 0) = -k(s)ei(s) 
as 


x(s)*3(s), 



-x(s)e 2 (s). 


3 J.F. Frenet (1816-1900) - French mathematician. 
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Explain the geometric meaning of the coefficient x(s) called the torsion of the 
curve at the corresponding point. 


10.4 The Finite-Increment Theorem and Some Examples 
of Its Use 

10.4.1 The Finite-Increment Theorem 

In our study of numerical functions of one variable in Sect. 5.3.2 we proved the 
finite-increment theorem for them and discussed in detail various aspects of this 
important theorem of analysis. In the present section the finite-increment theorem 
will be proved in its general form. So that its meaning will be fully obvious, we 
advise the reader to recall the discussion in that subsection and also to pay attention 
to the geometric meaning of the norm of a linear operator (see Sect. 10.2.2). 

Theorem 1 (The finite-increment theorem) Let f : U —*■ Y be a continuous map- 
ping of an open set U of a normed space X into a normed space Y . 

If the closed interval [x, x + h] = {§ e X | £ = x + Oh. 0 < 6 < 1 ) is contained in 
U and the mapping f is differentiable at all points of the open interval ]x, x + h[ — 
{£ e X | f = x + 6h, 0 < 6 < 1}, then the following estimate holds: 

\ f (x + h) — f (x) | Y < sup Wf\H)\\ C(XY Mx. (10.46) 

f e]x,x+/i[ 

Proof We remark first of all that if we could prove the inequality 

I f(x")-f(x')\< sup !/'($) ||jx"-x'| (10.47) 

fefV.x"] 

in which the supremum extends over the whole interval [x\ x"], for every closed 
interval (x', x"\ C ]v, x + h[, then, using the continuity of / and the norm together 
with the fact that 


sup | /'(§)!< sup |]/'G)||, 

%e\x,x+h[ 

we would obtain inequality (10.46) in the limit as x' —*■ x and x" x + h. 

Thus, it suffices to prove that 

\ f (x + h) — f (x)\ < M\h\, (10.48) 

where M = sup o<0<1 || f'(x + Oh) || and the function f is assumed differentiable on 
the entire closed interval [x, x + h]. 
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The very simple computation 

|/(*3) - /Ol)| < | /(*3) - /U 2 ) | + |/0'2) - f(x i)| < 

< M \X 3 - X2I + M|t 2 - Til = M(|x 3 - T2I + |.T 2 - Til) = 

= M |t' 3 — Til, 

which uses only the triangle inequality and the properties of a closed interval, shows 
that if an inequality of the form (10.48) holds on the portions [ti, T2] and [T2, T3] 
of the closed interval [ti , T3], then it also holds on [ti , T3]. 

Hence, if estimate (10.48) fails for the closed interval [t, t + h], then by succes- 
sive bisections, one can obtain a sequence of closed intervals [a*, hf\ C ]t, x + h[ 
contracting to some point xq e [x,x + li] such that (10.48) fails on each inter- 
val [cik,bk\- Since to e [ cik,bk\ , consideration of the closed intervals [cik, to] and 
[to, bk\ enables us to assume that we have found a sequence of closed intervals of 
the form [to, to + hk] C [t, t + h ], where hk — »• 0 as k — ► 00 on which 

\f(x 0 + h k )-f(x 0 )\>M\h k \. (10.49) 

If we prove (10.48) with M replaced by M + e, where e is any positive number, 
we will still obtain (10.48) as e -* 0, and hence we can also replace (10.49) by 

| /(to + h k ) - /(to) I >(M + s)\h k | (10.49') 

and we can now show that this is incompatible with the assumption that / is differ- 
entiable at to. 

Indeed, by the assumption that / is differentiable, 

\f(x 0 +h k ) - /(t 0 ) | = \f'(x 0 )hk + o(h k )\ < 

< I //tq) 1 \hk\ + o(\hk\) < (M + s)\hk\ 


as hk — > 0. □ 

The finite-increment theorem has the following useful, purely technical corollary. 

Corollary If A e C(X\ Y), that is, A is a continuous linear mapping of the normed 
space X into the normed space Y and f : U — > Y is a mapping satisfying the hy- 
potheses of the finite-increment theorem , then 

\f(x + h)~ f(x)-Ah \ < sup ||/'(£) -A|||fc|. 

f 6 ]x,x+h[ 

Proof For the proof it suffices to apply the finite-increment theorem to the mapping 


1 1 — > F(t ) = fix + th ) — Ath 
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of the unit interval [0, l]cR into Y, since 


F (1) - F( 0) = f(x +h)~ fix ) - Ah, 

F\0) = f\x + 0h)h — Ah forO<0<l, 
||F'(0)|| < \ f'ix + 0h)-A\\h\, 


sup 1 F'iO) | < sup || /(f)-A|||/z|. 


□ 



$e)x ,x+h[ 


Remark As can be seen from the proof of Theorem 1 , in its hypotheses there is no 
need to require that / be differentiable as a mapping /:[/—> Y; it suffices that its 
restriction to the closed interval [x, x + h] be a continuous mapping of that interval 
and differentiable at the points of the open interval ]x,x + h[. 

This remark applies equally to the corollary of the finite-increment theorem just 
proved. 


10.4.2 Some Applications of the Finite-Increment Theorem 

a. Continuously Differentiable Mappings 


Let 


f:U^Y 


(10.50) 


be a mapping of an open subset U of a normed vector space X into a normed 
space Y. If / is differentiable at each point .ref/, then, assigning to the point x 
the mapping fix) e C(X: Y) tangent to / at that point, we obtain the derivative 
mapping 


/' :U -* CiX; Y). 


(10.51) 


Since the space CiX; Y) of continuous linear transformations from X into Y is, 
as we know, a normed space (with the transformation norm), it makes sense to speak 
of the continuity of the mapping (10.51). 

Definition When the derivative mapping (10.51) is continuous in U, the mapping 
(10.50), in complete agreement with our earlier terminology, will be said to be con- 
tinuously differentiable. 

As before, the set of continuously differentiable mappings of type (10.50) will be 
denoted by the symbol C (1 ^(t/, Y), or more briefly, C <]) (U), if it is clear from the 
context what the range of the mapping is. 

Thus, by definition 


feC (l) iU, Y) ^ /' eC(U,dX; T)). 
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Let us see what continuous differentiability of a mapping means in different par- 
ticular cases. 

Example 1 Consider the familiar situation when X — Y — R, and hence / : U — >■ R 
is a real-valued function of a real argument. Since any linear mapping A e £(R; LRi) 
reduces to multiplication by some number ael, that is, Ah — ah and obviously 
|| A || = |a|, we find that f'(x)h — a(x)h, where a(x) is the numerical derivative of 
the function / at the point x . 

Next, since 

( fix + 8) - f'(xj)h = f'(x + S)h - f'(x)h = 

— a(x + S)h — a(x)h — (a(x + S) — a(x))h , (10.52) 


it follows that 

\f\x + S) - f'(x)\\ = \a(x + 5) — a(x)\ 

and hence in this case continuous differentiability of the mapping / is equivalent to 
the concept of a continuously differentiable numerical function (of class C ( 1 1 ( U, R) ) 
studied earlier. 

Example 2 This time suppose that X is the direct product X\ x • • ■ x X n of normed 
spaces. In this case the mapping (10.50) is a function f(x) — f{x \, . . . , x m ) of m 

variables Xj e Xj, i = 1 m, with values in Y . 

If the mapping / is differentiable at x e U, its differential d/ (x) at that point is 
an element of the space C(X\ x ■ ■ • x X m = X : Y). 

The action of d/(x) on a vector h — (hi , . . . , h m ), by formula (10.45), can be 
represented as 

d f(x)h = dif(x)h\ H h d m f(x)h m , 

where 3 i fix) : X, Y , i = \ , m, are the partial derivatives of the mapping / 
at the point x under consideration. 

Next, 


(d f(x + 8)- d f(x))h = £(3 if(x + 5) - dif(x))hi. (10.53) 

i=i 

But by the properties of the standard norm in the direct product of normed spaces 
(see Example 6 in Sect. 10.1.2) and the definition of the norm of a transformation, 
we find that 

\\dif(x + 8) -dif(x)\\ C(X .. Y) < |d/(x + <5) — d/(x)|| £(Z;y) < 

m 

<J2\\d i f(x+8)-d i f(x) | £(Z;;y) 

i=t 


(10.54) 
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Thus in this case the differentiable mapping (10.50) is continuously differentiable 
in U if and only if all its partial derivatives are continuous in U . 

In particular, if X = R” ! and Y = R, we again obtain the familiar concept of 
a continuously differentiable numerical function of m real variables (a function of 
class C (1 '(I7, M), where U C R m ). 

Remark It is worth noting that in writing (10.52) and (10.53) we have made es- 
sential use of the canonical identification TX x ~ X, which makes it possible to 
compare or identify vectors lying in different tangent spaces. 

We shall now show that continuously differentiable mappings satisfy a Lipschitz 
condition. 

Proposition 1 If K is a convex compact set in a normed space X and f e 
C m (K, Y), where Y is also a normed space, then the mapping f : K —*■ Y sat- 
isfies a Lipschitz condition on K, that is, there exists a constant M > 0 such that the 
inequality 

\f (* 2 ) — /C*i)| < M\xt — x\\ (10.55) 

holds for any points x \ , X 2 € K. 

Proof By hypothesis f' : K — > C(X; T) is a continuous mapping of the compact 
set K into the metric space CiX : Y). Since the norm is a continuous function on a 
normed space with its natural metric, the mapping x |/'(a')||, being the compo- 
sition of continuous functions, is itself a continuous mapping of the compact set K 
into R. But such a mapping is necessarily bounded. Let M be a constant such that 
||/'(x)|| < M at any point x e K . Since K is convex, for any two points xi e K and 
X 2 e K the entire interval [xi, xj] is contained in K . Applying the finite-increment 
theorem to that interval, we immediately obtain relation (10.55). □ 

Proposition 2 Under the hypotheses of Proposition 1 there exists a non-negative 
function co(S) tending to 0 as S —>■ +0 such that 

| fix + h) - f(x) - f(x)h\ < co(S)\h\ (10.56) 

at any point x e K for \h\ < S if x + h e K . 

Proof By the corollary to the finite-increment theorem we can write 

\f(x + h)-f(x)-f'(x)h \ < sup \f(x + 0h)-f'(x)\\h\ 

O<0<1 

and, setting 

co(8)= sup ||/'(x 2 ) - /'(xi)||, 

X\,X2€ K 

\xi - x 2 \<s 
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we obtain (10.56) in view of the uniform continuity of the function x i-»- /' (x ) , 
which is continuous on the compact set K . □ 


b. A Sufficient Condition for Differentiability 

We shall now show that by using the general finite-increment theorem, we can obtain 
a general sufficient condition for differentiability of a mapping in terms of its partial 
derivatives. 

Theorem 2 Let U be a neighborhood of the point x in a normed space X = X \ x 
■ • ■ x X m , which is the direct product of the normed spaces Xi x ■ • ■ x X m , and let 
f : U — >• Y be a mapping of U into a normed space Y. If the mapping f has partial 
derivatives with respect to all its variables in U , then it is differentiable at the point 
x if the partial derivatives are all continuous at that point. 

Proof To simplify the writing we carry out the proof for the case m — 2. We verify 
immediately that the mapping 

Lh = dif(x)ln + d 2 f(x)h 2 , 

which is linear in h = {h\, h 2 ), is the total differential of / at x. 

Making the elementary transformations 

f(x + h)~ f{x)~ Lh = 

= fix l +hi,x 2 + h 2 ) - fix i,x 2 ) - d x f(x)h\ - d 2 f(x)h 2 = 

= fix 1 +ln,x 2 + h 2 ) - fixi,x 2 + h 2 ) - d\fix\,x 2 )h\ + 

+ fixi,x 2 + h 2 ) - fix i,x 2 ) - d 2 fixi,x 2 )h 2 , 
by the corollary to Theorem 1 we obtain 

\fix\ +hi,x 2 + h 2 ) - f(x i,x 2 ) - d\f ix\, x 2 )h\ - d 2 f ix\ , x 2 )h 2 1 < 

< sup \difixi+0ihi,X2 + h 2 )-dif(xi,X 2 )\\\hi\ + 

o<e»i<i 

+ sup \d 2 fixi,x 2 +0 2 h 2 ) - d 2 fixi,x 2 ) ||| h 2 \. (10.57) 

O<02 < 1 

Since max{|/i] |, \h 2 } < \h\, it follows obviously from the continuity of the par- 
tial derivatives d\f and d 2 f at the point x = (x] , x 2 ) that the right-hand side of 
inequality (10.57) is o(h ) as h — (h\, h 2 ) — > 0. □ 

Corollary A mapping f : U -x Y of an open subset U of the normed space X — 
X[ x • • • x X m into a normed space Y is continuously differentiable if and only if 
all the partial derivatives of the mapping f are continuous. 
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Proof We have shown in Example 2 that when the mapping / \ U —> Y is differ- 
entiable, it is continuously differentiable if and only if its partial derivatives are 
continuous. 

We now see that if the partial derivatives are continuous, then the mapping / is 
automatically differentiable, and hence (by Example 2) also continuously differen- 
tiable. □ 


10.4.3 Problems and Exercises 

1. Let /:/—>■ 7 be a continuous mapping of the closed interval / = [0. I ] C R into 
a normed space Y and g : I -> M a continuous real-valued function on I . Show that 
if / and g are differentiable in the open interval ]0, 1[ and the relation ||/ , (f)ll < 
g'(t) holds at points of this interval, then the inequality |/(1) — /( 0)| < g(l) — g(0) 
also holds. 

2. a) Let / : I — ► Y be a continuously differentiable mapping of the closed interval 
/ = [0, 1] C M into a normed space Y. It defines a smooth path in Y . Define the 
length of that path. 

b) Recall the geometric meaning of the norm of the tangent mapping and give 
an upper bound for the length of the path considered in a). 

c) Give a geometric interpretation of the finite-increment theorem. 

3. Let / : U — >■ Y be a continuous mapping of a neighborhood U of the point a in 
a normed space X into a normed space Y. Show that if / is differentiable in U\a 
and f'(x) has a limit L e C(X; Y) as x — > a, then the mapping / is differentiable 
at a and f'(a ) = L. 

4. a) Let U be an open convex subset of a normed space X and / : U -» Y a 
mapping of U into a normed space Y. Show that if f(x) = 0 on U, then the mapping 
/ is constant. 

b) Generalize the assertion of a) to the case of an arbitrary domain U (that is, 
when U is an open connected subset of X). 

c) The partial derivative of a smooth function / : I) — > R defined in a domain 

D C R 2 of the xy-plane is identically zero. Is it true that / is then independent of y 
in this domain? For which domains D is this true? 


10.5 Higher-Order Derivatives 
10.5.1 Definition of the nth Differential 

Let U be an open set in a normed space X and 

f :U Y (10.58) 


a mapping of U into a normed space Y. 
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If the mapping (10.58) is differentiable in U, then the derivative of /, given by 

f:U^C(X;Y), (10.59) 


is defined in U . 

The space C(X\ Y) Y\ is a normed space relative to which the mapping 
(10.59) has the form (10.58), that is, /' : U — > Fi, and it makes sense to speak 
of differentiability for it. 

If the mapping (10.59) is differentiable, its derivative 

(/')' : U -* C(X\ Fi) = C(X; C(X\ Y)) 

is called the second derivative or second differential of / and denoted f" or / (2) . 
In general, we adopt the following inductive definition. 

Definition 1 The derivative of order n el or nth differential of the mapping 
(10.58) at the point x e U is the mapping tangent to the derivative of / of order 
n — 1 at that point. 

If the derivative of order k e N at the point x e U is denoted f ik) (x), Definition 1 
means that 

f (n \x):={f^)\x). (10.60) 

Thus, if f (n \x) is defined, then 

f in) (x) e C(X; Y n ) = C(X; £(Z; F„_i)) = 

= £(X;F))...). 

Consequently, by Proposition 4 of Sect. 10.2, f l "Hx). the differential of order n of 
the mapping (10.58) at the point x can be interpreted as an element of the space 
C(X , . . , , X; F) of continuous «-linear transformations. 

n factors 

We note once again that the tangent mapping f'(x) : TX x —*■ TYf( x ) is a map- 
ping of tangent spaces, each of which, because of the affine or vector-space structure 
of the spaces being mapped, we have identified with the corresponding vector space 
and said on that basis that f(x) e C(X: F). It is this device of regarding elements 
fix i) e C(TX xl \TYf (xi) ) and f(x 2 ) e C(TX X2 , TY f(X2) ), which lie in different 
spaces, as vectors in the same space C{X\ Y) that provides the basis for defining 
higher-order differentials of mappings of normed vector spaces. In the case of an 
affine or vector space there is a natural connection between vectors in the differ- 
ent tangent spaces corresponding to different points of the original space. In the 
final analysis, it is this connection that makes it possible to speak of the continuous 
differentiability of both the mapping (10.58) and its higher-order differentials. 
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10.5.2 Derivative with Respect to a Vector and Computation 
of the Values of the nth Differential 

When we are making the abstract Definition 1 specific, the concept of the derivative 
with respect to a vector may be used to advantage. This concept is introduced for 
the general mapping (10.58) just as was done earlier in the case X — R"', Y — R. 

Definition 2 If X and Y are normed vector spaces over the field R, the derivative 
of the mapping (10.58) with respect to the vector h e TX x ~ X at the point x E U 
is defined as the limit 


D h f(x) 


lim 


f(x + th) - f{x) 
t 


provided this limit exists. 


It can be verified immediately that 

D lh f(x) = XD h f(x) (10.61) 

and that if the mapping / is differentiable at the point x e U, it has a derivative at 
that point with respect to every vector; moreover 

D h f{x) = f\x)h, (10.62) 

and, by the linearity of the tangent mapping, 

Dx^i+Xjhjfix) = A.i D >n f(x) + X 2 D h2 f(x). (10.63) 

It can also be seen from Definition 2 that the value D/,/(x) of the derivative of 
the mapping / : U — >■ Y with respect to a vector is an element of the vector space 
T Y f( x ) ~ Y, and that if L is a continuous linear transformation from Y to a normed 
space Z, then 

Dh(L o f)(x) — L o D h f(x). (10.64) 

We shall now try to give an interpretation to the value f^'fh i, . . . , h„) of the 
nth differential of the mapping / at the point x on the set (hi, .... h n ) of vectors 
hi e TX x ~X, i = l,...,n. 

We begin with n = 1. In this case, by formula (10.62) 

f\xm = f\x)h = D h f(x). 

We now consider the case n = 2. Since f (2 fx) e C(X; C(X\ F)), fixing a vector 
hi e X, we assign a linear transformation (f (2 fx)hi) e C(X; Y) to it by the rule 


/ti^/ (2) (*)/*!. 
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Then, after computing the value of this operator at the vector h 2 e X, we obtain an 
element of Y : 

f i2) (x)(h u h 2 ) := {f (2 \x)h x )h 2 e Y. (10.65) 

But 

f (2 \x)h = (f’)\x)h = D h f\x), 

and therefore 

f (2 \x)(h x , h 2 ) = (D, n f\x))h 2 . (10.66) 

If A e CiX: Y) and h e X, this pairing with Ah can be regarded not only as a 
mapping h Ah from X into Y . but as a mapping A i->- Ah from C(X: Y) into Y, 
the latter mapping being linear, just like the former. 

Comparing relations (10.62), (10.64), and (10.66), we can write 

( D hl f'(x))h 2 = D hl (f(x)h 2 ) = D h] D i n f(x). 

Thus we finally obtain 

f a \x){h\.h 2 ) = D h] D hl f(x). 


Similarly, one can show that the relation 

f {n \x)(hi ,. . . , h n ) :=(... {f (n \x)h l ) . ..h n ) = D hl D h2 ■ ■ ■ D hn f{x) (10.67) 

holds for any n e N, the differentiation with respect to the vectors being carried 
out sequentially, starting with differentiation with respect to h„ and ending with 
differentiation with respect to h 1 . 


10.5.3 Symmetry of the Higher-Order Differentials 

In connection with formula (10.67), which is perfectly adequate for computation as 
it now stands, the question naturally arises: To what extent does the result of the 
computation depend on the order of differentiation? 

Proposition Iftheform / (n, (.r) is defined at the point x for the mapping (10.58), it 
is symmetric with respect to any pair of its arguments. 

Proof The main element in the proof is to verify that the proposition holds in the 
case n — 2. 

Let l n and h 2 be two arbitrary fixed vectors in the space TX x ~ X. Since U is 
open in X , the following auxiliary function of t is defined for all values of (el 
sufficiently close to zero: 

Ft (hn h 2 ) = f(x + t(hi +h 2 )) - f(x + th\) - f(x + th 2 ) + f(x). 
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We consider also the following auxiliary function: 

giv) = f(x + t{h\ + i>)) - f{x + tv), 

which is certainly defined for vectors v that are collinear with the vector h 2 and such 
that |u| < |/? 2 1 . 

We observe that 


F t (hi, h2) = g(h2) - g(0). 

We further observe that, since the function /:{/—>• Y has a second differential 
f" (x ) at the point x e U, it must be differentiable at least in some neighborhood of 
x. We shall assume that the parameter t is sufficiently small that the arguments on 
the right-hand side of the equality that defines F t {h\, h 2 ) lie in that neighborhood. 

We now make use of these observations and the corollary of the mean-value 
theorem in the following computations: 

\F t (hi,h 2 )-t 2 f"ix)(h l ,h 2 )\ = 

= \gihi) - g(0) - t 1 f" {x){h\,h 2 )\ < 

< sup \g' i0 2 h 2 ) - t 2 f" ix)h\\\h 2 \ = 

0 <# 2 < 1 

= sup I (f'(x + t(hi + e 2 h 2 )) - fix + te 2 h 2 ))t - t 2 f"{x)h\ (j \h 2 \. 
O<02<1 

By definition of the derivative mapping we can write that 

f'(x + t(hi +d 2 h 2 )) = fix) + f"ix)(t ih\ +6 2 h 2 )) + oit) 


and 

fix + te 2 h 2 ) = fix) + f'ix){te 2 h 2 ) + o(o 

as t -> 0. Taking this relation into account, one can continue the preceding compu- 
tation, finding after cancellation that 

\F,ih u h 2 )-t 2 f"ix)ih l ,h 2 )\ =o(t 2 ) 


as t -* 0. But this equality means that 


f"ix)ih 2 ,h 2 )= lim 
0 


Ff{h\, h 2 ) 
t 2 


Since it is obvious that F t (h \ , h 2 ) — F t (h 2 , hi), it follows from this relation that 
f"ix)ihi,h 2 ) = f"{x)ih 2 ,hi). 

One can now complete the proof of the proposition by induction, repeating ver- 
batim what was said in the proof that the values of the mixed partial derivatives are 
independent of the order of differentiation. □ 
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Thus we have shown that the nth differential of the mapping (10.58) at the point 
A' e U is a symmetric n -linear transformation 

f (n \x) e C(TX x , ..., TX x ; TY m ) ~ C(X, . X;Y) 

whose value on the set (hi , .. . , h n ) of vectors hi e TX x — X, i = 1, . . . , n, can be 
computed by formula (10.67). 

If X is a finite-dimensional space having a basis [e\, e^} and hj — h'-e, is 
the expansion of the vector hj, j — 1 , ,n, with respect to that basis, then by the 
multilinearity of f (n \x) we can write 


f (n \x)(h l , ..., h n ) = f (n) {x)(ti\ e iu .... h%e in ) = 

= f ( "\x)(e il ,...,e in )h\ l 

Using our earlier notation / (x ) for D ei ■ ■ ■ D en f (x ) , we find finally that 
fW(x)(h 1 ,...,h n ) = d il ... i J(x)h\ 1 ---h i n ", 


where as usual summation extends over the repeated indices on the right-hand side 
within their range of variation, that is, from 1 to k. 

Let us agree to use the following abbreviation: 

f {n \x)(h,...,h)=: f (n> (x)ti l . (10.68) 

In particular, if we are discussing a finite-dimensional space X and h = h' ei, then 

f< a \x)h n = d iv .. in f(x)h il -...-h i *, 

which is already very familiar to us from the theory of numerical functions of several 
variables. 


10.5.4 Some Remarks 

In connection with the notation (10.68) consider the following example, which is 
quite useful and will be used in the next section. 

Example Let A e C(X\, , X n ; Y), that is, y — A(x i, . . . , x n ) is a continuous n- 

linear transformation from the product of the normed vector spaces X \ X n into 

the normed vector space Y . 

It was shown in Example 5 of Sect. 10.3 that A is a differentiable mapping A : 
X i x ■ • ■ x X n — »■ Y and 


A\x\, ..., x„)(h\, ... ,h„) = A{h\, X 2 , .... x n ) -\ t- A(xi, . . .,x n -i ,h„). 
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Thus, if X\ = ■ ■ ■ — X n — X and A is symmetric, then 


A' (x, . . . , x)(h , . . . , h) — nA(x , . . . , x, h ) =: (nAx n [ )h. 

n — l 


Hence, if we consider the function F : X — ► Y defined by the condition 

F(x) — A(x , . . . , x) =: Ax' 1 , 
it turns out to be differentiable and 

F\x)h = (/iAx” _1 )ft, 


that is, in this case 

F'(x) — nAx " -1 , 

where Ax" -1 A(x , . . . , x, •)■ 

n—l 

In particular, if the mapping (10.58) has a differential f in] (x) at a point x e U, 
then the function F(h ) = f ( '"\x)h" is differentiable, and 

F\h)=nf {n \x)ti'- { . (10.69) 

To conclude our discussion of the concept of an nth-order derivative, it is useful 
to add the remark that if the original function (10.58) is defined on a set U in a 
space X that is the direct product of normed spaces X i , . . . , X m , one can speak 
of the first-order partial derivatives 3i/(x), . . . , 3 m /(x) of / with respect to the 
variables x,- e X,, i = 1, . . . , m, and the higher-order partial derivatives 3 il ..., n /(x). 

On the basis of Theorem 2 of Sect. 10.4, we obtain by induction in this case that 
if all the partial derivatives 3/, - (n / (x) of a mapping f : U -> Y are continuous at 
a point x E X — Xi x • ■ ■ x X m , then the mapping f has an nth order differential 
/("> (x) at that point. 

If we also take account of the result of Example 2 from the same section, 
we can conclude that the mapping (/3.m f <n fx) e C{X , . . . , X; Y) is contin- 

n factors 

uous if and only if all the nth-order partial derivatives f/3xi-> 3, 1 ... in /(x) 6 
C(Xi ] , . . . , Xj n \ T) of the original mapping f : U — * Y are continuous (or, what 
is the same, the partial derivatives of all orders up to n inclusive are continuous). 

The class of mappings (10.58) having continuous derivatives up to order n in- 
clusive in U is denoted C (n] (U . Y), or, where no confusion can arise, by the briefer 
symbol C (n> (U) or even C^' \ 

In particular, if X — X \ x • • • x X n . the conclusion reached above can be written 
in abbreviated form as 

(/ e C (n) ) <=> (d h .., n fe C, i u . . . , i n = 1, . . . , m), 
where C, as always, denotes the corresponding set of continuous functions. 
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10.5.5 Problems and Exercises 

1. Carry out the proof of Eq. (10.64) in full. 

2. Give the details at the end of the proof that / (,i) (x) is symmetric. 

3. a) Show that if the functions D/ n Dh 2 f and /)/, 2 /.)/, , / are defined and contin- 
uous at a point x e U for a pair of vectors h\, hj and the mapping (10.58) in the 
domain U, then the equality Z)/ ;i Di, 2 f(x) = I)/,-, /)/,, fix) holds. 

b) Show using the example of a numerical function f{x, y ) that, although the 
continuity of the mixed partial derivatives and g-jfe implies by a) that they are 
equal at this point, it does not in general imply that the second differential of the 
function exists at the point. 

c) Show that, although the existence of f (2 ' 1 (x, y), guarantees that the mixed 

o2 r o2 r 

partial derivatives and —f exist and are equal, it does not in general guarantee 
that they are continuous at that point. 

4 . Let A e C(X , .... X; Y) where A is a symmetric /; -linear transformation. Find 
the successive derivatives of the function x Ax n := A(x, . . . , x) up to order n + I 
inclusive. 


10.6 Taylor’s Formula and the Study of Extrema 
10.6.1 Taylor ’s Form ula for Mappings 


Theorem 1 If a mapping f : U — > Y from a neighborhood U = U {x) of a point 
x in a normed space X into a normed space Y has derivatives up to order n — 1 
inclusive in U and has an nth order derivative f^’fx) at the point x, then 

fix + h) = fix) + f\x)h + • • ■ + -f (n \x)h n + 0(1*1") (10.70) 

n\ 

as h — > 0. 

Equality (10.70) is one of the varieties of Taylor’s formula, written here for rather 
general classes of mappings. 

Proof We prove Taylor’s formula by induction. 

For n — 1 it is true by definition of fix). 

Assume formula (10.70) is true for some (n — 1) e N. 

Then by the mean-value theorem, formula (10.69) of Sect. 10.5, and the induction 
hypothesis, we obtain 
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f(x + h) - (fix) + f'(x)h + • • • + ^f {n) ix)h'^j 


< sup 

O<0<1 




f , ix + eh)-(f\x) + f"ix)i0h) + 

1 


in - D! 


f^ix)mr- { 


\h\ = o(\eh\ n ~ 1 )\h\ = o(\h\ n ) 


as h —*■ 0. □ 

We shall not take the time here to discuss other versions of Taylor’s formula, 
which are sometimes quite useful. They were discussed earlier in detail for numeri- 
cal functions. At this point we leave it to the reader to derive them (see, for example. 
Problem 1 below). 


10.6.2 Methods of Studying Interior Extrema 

Using Taylor’s formula, we shall exhibit necessary conditions and also sufficient 
conditions for an interior local extremum of real-valued functions defined on an 
open subset of a normed space. As we shall see, these conditions are analogous to 
the differential conditions already known to us for an extremum of a real-valued 
function of a real variable. 

Theorem 2 Let f : U — > R. be a real-valued function defined on an open set U in a 
normed space X and having continuous derivatives up to order k — 1 > 1 inclusive 
in a neighborhood of a point x £ U and a derivative (x) of order k at the point 
x itself. 

If f(x) = 0, . . . , f^-^Hx) — 0 and /®(x) ^ 0, then for x to be an extremum 
of the function f it is: 

necessary that k be even and that the form f^ k fx)h k be semidefinitef and 
sufficient that the values of the form f^ k \x)h k on the unit sphere \h\ = 1 be 
bounded away from zero ; moreover, x is a local minimum if the inequalities 

f (k) {x)h k > 8 > 0 

hold on that sphere, and a local maximum if 

f (k) (x)h k < S < 0. 


4 This means that the form f^\x)h k cannot take on values of opposite signs, although it may 
vanish for some values h f 0. The equality f^fx) = 0, as usual, is understood to mean that 
f^ l \x)h = 0 for every vector h . 
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Proof For the proof we consider the Taylor expansion (10.70) of / in a neighbor- 
hood of x . The assumptions enable us to write 

fix + h) - fix) = l~f (k \x)h k + aih)\h\ k , 
k\ 

where a(h) is a real-valued function, and a(h) — >■ 0 as h 0. 

We first prove the necessary conditions. 

Since /®(x) ^ 0, there exists a vector ho 0 on which /®(x)/tQ ^ 0. Then 
for values of the real parameter t sufficiently close to zero, 

fix + tho) - fix) = ^-f {k \x)(th 0 ) k +aitho)\tho\ k = 
k\ 

= ^f (k) ix)h k ±aith 0 )\h 0 \ k y k 

and the expression in the outer parentheses has the same sign as ix)h^. 

For x to be an extremum it is necessary for the left-hand side (and hence also the 
right-hand side) of this last equality to be of constant sign when t changes sign. But 
this is possible only if k is even. 

This reasoning shows that if x is an extremum, then the sign of the difference 
fix + tho) — fix) is the same as that of /®(x)/tQ for sufficiently small t; hence 
in that case there cannot be two vectors ho, h \ at which the form /®(x) assumes 
values with opposite signs. 

We now turn to the proof of the sufficiency conditions. For definiteness we con- 
sider the case when f^ k \x)h k > 8 > 0 for \h\ = 1. Then 

fix + h)~ fix) = l~f (k \x)h k + aih)\h\ k = 
k\ 

= (ti / ® >w (wy + " < * ) ) | * |i - Gi s+ “ < '‘ ) ) | '‘ | *' 

and, since a(h ) -> 0 as h —*■ 0, the last term in this inequality is positive for all 
vectors h 0 sufficiently close to zero. Thus, for all such vectors h, 

fix + h)- fix) > 0, 

that is, x is a strict local minimum. 

The sufficient condition for a strict local maximum is verified similarly. □ 

Remark 1 If the space X is finite-dimensional, the unit sphere Six, 1) with center at 
x e X, being a closed bounded subset of X, is compact. Then the continuous func- 
tion fW(x)h k = 3 h ... ik fix)h n • ... -h‘ k (a fc-form) has both a maximal and a min- 
imal value on Six, 1). If these values are of opposite sign, then / does not have an 
extremum at x. If they are both of the same sign, then, as was shown in Theorem 2, 
there is an extremum. In the latter case, a sufficient condition for an extremum can 
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obviously be stated as the equivalent requirement that the form f ik) (x)h k be either 
positive- or negative-definite. 

It was this form of the condition that we encountered in studying real-valued 
functions on R" . 

Remark 2 As we have seen in the example of functions / : R” — > R, the semi- 
definiteness of the form f^h k exhibited in the necessary conditions for an ex- 
tremum is not a sufficient criterion for an extremum. 

Remark 3 In practice, when studying extrema of differentiable functions one nor- 
mally uses only the first or second differentials. If the uniqueness and type of ex- 
tremum are obvious from the meaning of the problem being studied, one can restrict 
attention to the first differential when seeking an extremum, simply finding the point 
x where fix) — 0. 


10.6.3 Some Examples 

Example 1 Let L e C^(R 3 ,R) and / e Z?],R). In other words, 

(w 1 , w , w 3 ) i— >- Liu 1 . u 2 , m 3 ) is a continuously differentiable real-valued function 
defined in R 3 and r k /(r) a smooth real-valued function defined on the closed 
interval [a, b] C R. 

Consider the function 


F :C a) ([a,b], R)->R 


(10.71) 


defined by the relation 


rb 

C (1) ([a,6],R) 3 /i-> F(f)= L[x, f(x), f\x)) d* e R. (10.72) 

J a 


Thus, (10.71) is a real-valued functional defined on the set of functions / e 
C(l)([a, b], R). 

The basic variational principles connected with motion are known in physics 
and mechanics. According to these principles, the actual motions are distinguished 
among all the conceivable motions in that they proceed along trajectories along 
which certain functionals have an extremum. Questions connected with the extrema 
of functionals are central in optimal control theory. Thus, finding and studying the 
extrema of functionals is a problem of intrinsic importance, and the theory associ- 
ated with it is the subject of a large area of analysis - the calculus of variations. 
We have already done a few things to make the transition from the analysis of the 
extrema of numerical functions to the problem of finding and studying extrema of 
functionals seem natural to the reader. However, we shall not go deeply into the 
special problems of variational calculus, but rather use the example of the func- 
tional (10.72) to illustrate only the general ideas of differentiation and study of local 
extrema considered above. 
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We shall show that the functional (10.72) is a differentiable mapping and find its 
differential. 

We remark that the function (10.72) can be regarded as the composition of the 


mappings 


F { : C (1) ([a,h],K) ^ C([a,b],R) 

(10.73) 

defined by the formula 


Fiif)ix) = L(x,fix),f'ix)) 

(10.74) 

followed by the mapping 


r b 

C([a,b],R) 3 g\-+ F 2 ig)= ^(x)thel. 

J a 

(10.75) 


By properties of the integral, the mapping F 2 is obviously linear and continuous, 
so that its differentiability is clear. 

We shall show that the mapping F\ is also differentiable, and that 

F[(f)h(x) = d 2 L(x, f{x), fix))h{x) + d 3 L(x, fix), f\x))h'ix ) (10.76) 

for/? eC (l) ([a,b],R). 

Indeed, by the corollary to the mean-value theorem, we can write in the present 
case 


L(m 1 + A 1 , u 2 + A 2 , u 2 + A 2 ) — L(u\ u 2 , h 3 ) — 3,L(m 1 , u 2 , u 2 )A' 


i = 1 


< sup \\(diLiu + 6A) — diLiu), d 2 Liu + 6A) — d 2 L(u), 

o<e<t 

3 3 L(m + e A) - 3 3 L(m))|| • |4| < 

<3 max \d, L(u + 9u) — dj L(u)\ ■ max 1.4' I, (10.77) 

0<e<l 1 1 £=1,2,3 1 

£=1,2,3 


where u = {u l ,u 2 , m 3 ) and A = (A 1 , A 2 , 4 3 ). 

If we now recall that the norm |/| C (i) of the function / in C (1) (|a< b], K) is 
max{ | / 1 c , | /' | c } (where | / 1 c is the maximum absolute value of the function on the 
closed interval [a, b]), then, setting u 1 = x, u 2 — fix), u 2 = fix), A 1 — 0, A 2 = 
hix), and A 2 — h\x), we obtain from inequality (10.77), taking account of the 
uniform continuity of the functions 3, Liif , u 2 , u 2 ) , / = 1,2,3, on bounded subsets 
of R 3 , that 

max I L(x, fix) + hix), f'ix) + h\x)) — L(x, f(x),f'ix)) — 

a<x<b 

- 3 2 L(x, fix), f'ix))hix ) - 3 3 L(X, fix)f'ix))h'ix) \ = 

= oi\h\ C (i)) as \h\ C (D — >■ 0. 
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But this means that Eq. (10.76) holds. 

By the chain rule for differentiating a composite function, we now conclude that 
the functional (10.72) is indeed differentiable, and 

nb 

F'{f)h = / ((9 2 L(.x, f(x)f'(x))h(x ) + d 3 L(x , f{x), f'(x)))h'(x}) dx. 

J Cl 

(10.78) 

We often consider the restriction of the functional (10.72) to the affine space 
consisting of the functions / e C^ l \[a, b ], K) that assume fixed values f{a) = A, 
f{b) = B at the endpoints of the closed interval [a, b], In this case, the functions h 
in the tangent space TC ( j l) must have the value zero at the endpoints of the closed 
interval [a,b\. Taking this fact into account, we may integrate by parts in (10.78) 
and bring it into the form 


F 'if )h = /(*). /'(*)) - ^3 L (x , fWf'ix^hix) dx, (10.79) 

of course under the assumption that L and / belong to the corresponding class C (2) . 

In particular, if / is an extremum (extremal) of such a functional, then by 
Theorem 2 we have F\f)h — 0 for every function h e C (1) ([a, £>],M) such that 
li(a) — h(b) = 0. From this and relation (10.79) one can easily conclude (see Prob- 
lem 3 below) that the function / must satisfy the equation 

9 iL{x, fix), fix)) - ^9 3 L(x, fix), fix)) = 0. (10.80) 

This is a frequently-encountered form of the equation known in the calculus of 
variations as the Euler-Lagrange equation. 

Let us now consider some specific examples. 

Example 2 (The shortest-path problem) Among all the curves in a plane joining two 
fixed points, find the curve that has minimal length. 

The answer in this case is obvious, and it rather serves as a check on the formal 
computations we will be doing later. 

We shall assume that a fixed Cartesian coordinate system has been chosen in 
the plane, in which the two points are, for example, (0, 0) and (1, 0). We confine 
ourselves to just the curves that are the graphs of functions f e C (l '([0, I ], R) as- 
suming the value zero at both ends of the closed interval [0, 1]. The length of such 
a curve 

Fif)= [ sjl + (ffix)dx (10.81) 

Jo 

depends on the function f and is a functional of the type considered in Example 1. 
In this case the function L has the form 


L(m 1 , u 2 , m 3 ) = y 1 + (m 3 )“, 
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and therefore the necessary condition (10.80) for an extremal here reduces to the 
equation 



from which it follows that 


fix) 


(10.82) 


= const 


v/1 + (f') 2 (x) 


on the closed interval [0, 1]. 

Since the function " — is not constant on any interval, Eq. (10.82) is possible 

V 1 +u 2 

only if /'(x) = const on [a , b] . Thus a smooth extremal of this problem must be a 
linear function whose graph passes through the points (0, 0) and (1,0). It follows 
that f(x) = 0, and we arrive at the closed interval of the line joining the two given 
points. 

Example 3 (The brachistochrone problem) The classical brachistochrone problem, 
posed by Johann Bernoulli I in 1696, was to find the shape of a track along which 
a point mass would pass from a prescribed point Pq to another fixed point P\ at a 
lower level under the action of gravity in the shortest time. 

We neglect friction, of course. In addition, we shall assume that the trivial case 
in which both points lie on the same vertical line is excluded. 

In the vertical plane passing through the points Pq and P\ we introduce a rect- 
angular coordinate system such that Pq is at the origin, the x-axis is directed ver- 
tically downward, and the point Pi has positive coordinates (x \ , yi). We shall find 
the shape of the track among the graphs of smooth functions defined on the closed 
interval [0,xi] and satisfying the condition /( 0) = 0, f(x i) = y\. At the moment 
we shall not take time to discuss this by no means uncontroversial assumption (see 
Problem 4 below). 

If the particle began its descent from the point Pq with zero velocity, the law of 
variation of its velocity in these coordinates can be written as 


v = \/2gx. 


(10.83) 


Recalling that the differential of the arc length is computed by the formula 



(10.84) 


we find the time of descent 



(10.85) 


along the trajectory defined by the graph of the function y — f (x ) on the closed 
interval [0, xi]. 
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For the functional (10.85) 

L(m 1 , u 2 , m 3 ) = . 


'l + (« 3 ) 2 


and therefore the condition (10.80) for an extremum reduces in this case to the 
equation 


fix) 


dx V n/x( 1 + (f') 2 (x) 


= 0 , 


from which it follows that 


fix) 


yi + if ’fix) 


= cfx, 


( 10 . 86 ) 


where c is a nonzero constant, since the points are not both on the same vertical line. 
Taking account of (10.84), we can rewrite (10.86) in the form 


d .V r 

— =cVx. 
as 

However, from the geometric point of view 


dx 

ds 


= cos< p, 


d y 

ds 


= stn</t. 


(10.87) 


(10.88) 


where <p is the angle between the tangent to the trajectory and the positive x-axis. 
By comparing Eq. (10.87) with the second equation in (10.88), we find 


1 . 9 

x = — sin” <p. 


(10.89) 


But it follows from (10.88) and (10.89) that 


dv dy dx dx d /sin 2 ®\ sin 2 ® 

— = — • — = tan m — = tan m — I 1 = 2 


= tan cp — = tan (p — . 
d <p dx d <p d tp dip \ 


from which we find 


2 

y — ~^(2(p - sin 2<p) + b. 

r- 


(10.90) 


Setting 2/c 2 =: a and 2ip t, we write relations (10.89) and ( 10.90) as 

x = a(l — cost), 
y = a(t — sint) + b. 


(10.91) 


Since a ^ 0, it follows that x — 0 only for t = 2kn , k e Z. It follows from the 
form of the function (10.91) that we may assume without loss of generality that the 
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parameter value t — 0 corresponds to the point Pq — (0, 0). In this case Eq. (10.90) 
implies b = 0, and we arrive at the simpler form 


x = «(1 — cos t), 
y = ait — sinf) 


(10.92) 


for the parametric definition of this curve. 

Thus the brachistochrone is a cycloid having a cusp at the initial point Pq where 
the tangent is vertical. The constant a, which is a scaling coefficient, must be chosen 
so that the curve (10.92) also passes through the point P\ . Such a choice, as one can 
see by sketching the curve (10.92), is by no means always unique, and this shows 
that the necessary condition (10.80) for an extremum is in general not sufficient. 
However, from physical considerations it is clear which of the possible values of 
the parameter a should be preferred (and this, of course, can be confirmed by direct 
computation). 


10.6.4 Problems and Exercises 

1. Let f : U —*■ Y be a mapping of class C (n> (U ; Y) from an open set U in a normed 
space X into a normed space Y . Suppose the closed interval [jc, x + h] is entirely 
contained in U, that / has a differential of order in + 1 ) at the points of the open 
interval ]x, x + h[, and that ||yf n + 1 )(£)|| < M at every point £ e ]x, x + h[. 

a) Show that the function 

git) = fix + th) - (^fix) + f'(x)(th) H b ^ f {n) (x)(th) n ^j 

is dehned on the closed interval [0, 1] C R and differentiable on the open interval 
]0, 1[, and that the estimate 

k'(f)|| < —M\th\ n \h\ 
n\ 


holds for every t e ]0, 1[. 

b) Show that |g(l) - g(0)| < ^M|/r|" +1 . 

c) Prove the following version of Taylor’s formula: 


fix + h) - ( fix) + f\x)h + • • • + -f (n) ix)h n 

n\ 


M 


(71+1)! 


\h\ 


n -\- 1 


d) What can be said about the mapping / : U — >■ Y if it is known that 
f (n+l \x) = 0 in t/? 

2. a) If a symmetric 77 -linear operator A is such that Ax n = 0 for every vector 
x e X, then A(x\, , x n ) = 0, that is, A equals zero on every set x\, ... ,x„ of 
vectors in X. 
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b) If a mapping / : U — > Y has an nth-order differential f tn) (x) at a point x e U 
and satisfies the condition 

/ (x + h) = Lq + L\h + ■••-( — -L n h n + a(h)\h\" , 

n\ 

where L,-, i = 0, 1, . . . , n, are /-linear operators, and a(h) — > 0 as h — > 0, then 
Lt = / (i) (x), i = 0, 1 n. 

c) Show that the existence of the expansion for / given in the preceding problem 
does not in general imply the existence of the nth order differential f (n Hx) (for 
n > 1 ) for the function at the point x . 

d) Prove that the mapping C(X: Y) 3 A m* A~ l e C(X: Y ) is infinitely dif- 
ferentiable in its domain of definition, and that (A~ l ) (n) (A)(hi , . . . , h n ) = 
(— l)"A _1 /zi A~ l h 2 ■ A~ l h n A~ l . 

3. a) Let (p e C([a, b], Rj. Show that if the condition 

b 

(p(x)h(x) dx = 0 

holds for every function h e C® ([n, b], M) such that h(a) — h(b) = 0, then 
<p(x) = 0 on [a, b]. 

b) Derive the Euler-Lagrange equation (10.80) as a necessary condition for 
an extremum of the functional (10.72) restricted to the set of functions / e 
C (2) ([fl,Z>],K) assuming prescribed values at the endpoints of the closed interval 
[a,b]. 

4. Find the shape y = f(x), a < x < b, of a meridian of the surface of revolution 
(about the x-axis) having minimal area among all surfaces of revolution having 
circles of prescribed radius r a and r/, as their sections by the planes x = a and x = b 
respectively. 

5. a) The function L in the brachistochrone problem does not satisfy the condi- 
tions of Example 1, so that we cannot justify a direct application of the results of 
Example 1 in this case. Show by repeating the derivation of formula (10.79) with 
necessary modifications that this equation and Eq. (10.80) remain valid in this case. 

b) Does the equation of the brachistochrone change if the particle starts from the 
point P Q with a nonzero initial velocity (the motion is frictionless in a closed pipe)? 

c) Show that if P is an arbitrary point of the brachistochrone corresponding 
to the pair of points Pq, P \ , the arc of that brachistochrone from Pq to P is the 
brachistochrone of the pair Pq, P. 

d) The assumption that the brachistochrone corresponding to a pair of points 
P 0 , Pi can be written as y — /(x), is not always justified, as was revealed by the 
final formulas (10.92). Show by using the result of c) that the derivation of (10.92) 
can be carried out without any such assumption as to the global structure of the 
brachistochrone. 

e) Locate a point P\ such that the brachistochrone corresponding to the pair of 
points Pq , Pi in the coordinate system introduced in Example 3 cannot be written 
in the form y = f(x). 
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f) Locate a point P\ such that the brachistochrone corresponding to the pair 

of points P 0 , P\ in the coordinate system introduced in Example 3 has the form 
y — /(x), and / C {l \[a, b], R). Thus it turns out that in this case the functional 

(10.85) we are interested in has a greatest lower bound on the set C (1, ([fl, b], M), 
but not a minimum. 

g) Show that the brachistochrone of a pair of points Pq, P[ of space is a smooth 
curve. 

6. Let us measure the distance d(Po, Pi) of the point Pq of space from the point Pi 
in a homogeneous gravitational field by the time required for a point mass to move 
from one point to the other along the brachistochrone corresponding to the points. 

a) Find the distance from the point Pq to a fixed vertical line, measured in this 
sense. 

b) Find the asymptotic behavior of the function d(PQ, P\) as the point P\ is 
raised along a vertical line, approaching the height of the point Pq. 

c) Determine whether the function d(Po, Pi) is a metric. 


10.7 The General Implicit Function Theorem 

In this concluding section of the chapter we shall illustrate practically all of the 
machinery we have developed by studying an implicitly defined function. The reader 
already has some idea of the content of the implicit theorem, its place in analysis, 
and its applications from Chap. 8. For that reason, we shall not go into detail here 
with preliminary explanations of the essence of the matter preceding the formalism. 
We note only that this time the implicitly defined function will be constructed by an 
entirely different method, one that relies on the contraction mapping principle. This 
method is often used in analysis and is quite useful because of its computational 
efficiency. 

Theorem Let X, Y , and Z be normed spaces (for example, R m , M”, andM. k ), Y be- 
ing a complete space. Let W = {(x, y) e X x Y \ \x — xo| < a A |y — yol < f] be a 
neighborhood of the point (xq, yo) in the product X x Y of the spaces X and Y . 
Suppose that the mapping F : W — >■ Z satisfies the following conditions : 

1. F(xq, yo) = 0; 

2. F(x, y ) is continuous at (xo, yo); 

3. F'(x, y) is defined in W and continuous at (xq, yo); 

4. F' y (x o, yo) is an invertible 5 transformation. 

Then there exists a neighborhood U — U (xo) of x o e X, a neighborhood V = 
V (yo) ofyo&Y, and a mapping f : U — >■ V such that : 

V. U xV CW-, 


5 That is, 3 [F;(x 0 , yo)]” 1 S £(Z; Y). 
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21. ( F(x , y) — 0 in U x V) <£> (y = fix), where x eU and f(x) e V); 

3'. yo = /Uo); 

4' . f is continuous at xo. 

In essence , this theorem asserts that if the linear mapping F' is invertible at a 
point ( hypothesis 4), then in a neighborhood of this point the relation F(x, y) = 0 
is equivalent to the functional dependence y — fix) ( conclusion 2’). 

Proof 1° To simplify the notation and obviously with no loss of generality, we may 
assume that xq — 0,yo — 0, and consequently 


W = {(x, y) e X x Y \ \x\ < a A |y| < 


2° The main role in the proof is played by the auxiliary family of functions 


gxiy)-=y~ {Fy(0, 0)) l -F(x,y), 


(10.93) 


which depend on the parameter x e S, |x| < a, and are defined on the set {y e Y \ 

I y\<P]. 


Let us discuss formula (10.93). We first determine whether the mappings g x are 
unambiguously defined and where their values lie. 

The mapping F is defined for (x, y) e W, and its value F(x,y ) at the pair (x, y) 
lies in Z. The partial derivative F' (x , y) at any point (x, y) e W, as we know, is a 
continuous linear mapping from Y into Z. 

By hypothesis 4 the mapping F'(0, 0) : Y Z has a continuous inverse 
(Fy( 0, 0)) _l : Z — > Y. Hence the composition (F'(0, 0)) _1 ■ F(x, y) really is de- 
fined, and its values lie in Y . 

Thus, for any x in the a -neighborhood Bx(0, a) := {x e X \ \x\ < a} of the point 
0 e X, the function g x is a mapping g x : By( 0, f) Y from the ^-neighborhood 
By(0, /3) := {y eY \ |y| < f} of the point 0 e Y into Y. 

The connection of the mappings (10.93) with the problem of solving the equation 
F{x, y) = 0 for y obviously consists of the following: the point y x is a fixed point 
of g x if and only if Fix, y x ) = 0. 

Let us state this important observation firmly: 



(10.94) 


Thus, finding and studying the implicitly defined function y — y x = fix) reduces 
to finding the fixed points of the mappings (10.93) and studying the way in which 
they depend on the parameter x . 

3° We shall show that there exists a positive number y < min{a, f>\ such that for 
each x e X satisfying the condition |x| < y < a, the mapping g x : ByiO, y) —> Y of 
the ball ByiO, y) {y e Y \ |y| < y < f} into Y is a contraction with a coefficient 
of contraction that does not exceed, say 1/2. Indeed, for each fixed x e Bxi0,a) 
the mapping g x : ByiO, f) — > Y is differentiable, as follows from hypothesis 3 and 
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the theorem on differentiation of a composite mapping. Moreover, 

g' x (y) =ey — (Fy(0, 0)) _I • (Fy(x, y)) = 

= {Fy(0, o )) _1 (f;(0, 0) - F’ y (x, >0). (10.95) 

By the continuity of F'(x,y) at the point (0,0) (hypothesis 3), there exists a 
neighborhood {(x, y) e X x Y | |x| < y < a A |y| < y < P) of (0, 0) e X x Y in 
which 


||^0')||<||(^(0,0)) 1 ||-||F;(0,0)- J F;(x,y)||<^. (10.96) 

Here we are using the relation 

{Fy (0, 0))“' e £(Z; Y), that is, || (F' y ( 0, 0)) _1 1| < oo. 

Throughout the following we shall assume that \x\ < y and y < y , so that 
estimate (10.96) holds. 

Thus, at any x e Bx( 0, y) and for any y\ , yy e By( 0, y), by the mean-value 
theorem, we indeed now find that 

|g.r(yt)-g A (.V 2 )| < sup ||g'(§)|||yi -y 2 | < \\y\ -yi\- (10.97) 

fe]yi,y2[ * 

4°. In order to assert the existence of a fixed point y x for the mapping g x , we 
need a complete metric space that maps into (but not necessarily onto) itself under 
this mapping. 

We shall verify that for any s satisfying 0 < s < y there exists S — 8 (e) in the 
open interval ]0, y[ such that for any x e Bx( 0, 8) the mapping g x maps the closed 
ball B y ( 0, e) into itself, that is, g x (By (0, e)) C By( 0, e). 

Indeed, we first choose a number 8 e ]0, y[ depending on e such that 

|fo(0)| = |(f;(0,0)) _1 -F(x,0)|| < ||(f;(0,0))“ 1 |||F(x,0)| < (10.98) 

for |x| < 8. 

This can be done by hypotheses 1 and 2, which guarantee that F( 0, 0) = 0 and 
F(x, y) is continuous at (0, 0). 

Now if |x| < 8(e) < y and |y| < e < y, we find by (10.97) and (10.98) that 

l&tOOl < |^(.v) - f,(0)| + |fo(0)| < ^|y| + < e, 

and hence for |x| < 8(e) 


g x (3y(0,e))cBy(0,e). 


(10.99) 
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Being a closed subset of the complete metric space Y, the closed ball By( 0, e) is 
itself a complete metric space. 

5° Comparing relations (10.97) and (10.99), we can now assert by the fixed- 
point principle (Sect. 9.7) that for each x e Bx( 0, 5(e)) =: U there exists a unique 
point y = y x =: fix) e By{ 0, e) =: V that is a fixed point of the mapping g x : 
By( 0, s)^By( 0, e). 

By the basic relation (10.94), it follows from this that the function / : U —*■ V 
so constructed has property 2' and hence also property 3', since F( 0, 0) = 0 by 
hypothesis 1. 

Property 1' of the neighborhoods U and V follows from the fact that, by con- 
struction, U x V C Bx( 0, a) x By (0, ft) = W. 

Finally, the continuity of the function y = f (x) at x = 0, that is, property 4', 
follows from 2' and the fact that, as was shown in part 4° of the proof, for every 
s > 0 (e < y) there exists 5(e) > 0 (5(e) < y) such that g A -(Z?y(0, e)) C By( 0, e) 
for any x e Bx( 0, 5(e)), that is, the unique fixed point y x = f(x) of the mapping 
g x : By( 0, e) —*■ By( 0, e) satisfies the condition |/(x)| < e for \x\ < 5(e). □ 

We have now proved the existence of the implicit function. We now prove a 
number of extensions of these properties of the function, generated by properties of 
the original function F. 

Extension 1 (Continuity of the implicit function) If in addition to hypotheses 2 
and 3 of the theorem it is known that the mappings F : W —*■ Z and F' y are contin- 
uous not only at the point (xo, Vo) but in some neighborhood of this point, then the 
function f : U — >■ V will be continuous not only at xq e U but in some neighborhood 
of this point. 

Proof By properties of the mapping C(Y; Z) 3 A t-x A~ l e C(Z; Y) it follows from 
hypotheses 3 and 4 of the theorem (see Example 6 of Sect. 10.3) that at each point 
(x, y) in some neighborhood of (xo, yo) the transformation f'(x, y) e C(Y; Z) is 
invertible. Thus under the additional hypothesis that F is continuous all points (x, y ) 
of the form (x, /(x)) in some neighborhood of (xo, yo) satisfy hypotheses 1-4, 
previously satisfied only by the point (xo, yo). 

Repeating the construction of the implicit function in a neighborhood of these 
points (x, y), we would obtain a function y = /(x) that is continuous at x and by 2' 
would coincide with the function y = /(x) in some neighborhood of x. But that 
means that / itself is continuous at x. □ 

Extension 2 (Differentiability of the implicit function) If in addition to the hypothe- 
ses of the theorem it is known that a partial derivative F' x (x , y ) exists in some neigh- 
borhood W of (x o, yo) and is continuous at (xo, yo), then the function y — f(x) is 
differentiable at xo, and 

/(x 0 ) = -(/(xo, yo)) * ■ (/(xo, yo))- 


(10.100) 
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Proof We verify immediately that the linear transformation L e £(X: Y ) on the 
right-hand side of formula (10.100) is indeed the differential of the function y = 
f(x) at.ro- 

As before, to simplify the notation, we shall assume that xo = 0 and yo — 0, so 
that /(0) = 0. 

We begin with a preliminary computation. 


\f(x) — f(0) — Lx 


fix) -Lx\ = 


< 


< 


/(x) + (f;(o,o)) _ 1 -(^(o,o))x| = 

(^(0, 0)) _l (/y'(0, 0)JC + F[M 0 )fix)) \ = 

{F' y i 0, 0)) _1 (F(x, fix)) - F(0, 0) - F' x ({f 0)x - F' y { 0, 0)/(x» | < 

| (F'(0, 0))“' 1 1 (F(x, /(x)) - F (0, 0) - F' x i 0, 0)x - F^O, 0)/(x)) | < 
| (Fy(0, 0)) _ 1 1 ■ a(x, f (x))(|x| + |/(x)|), 


where aix, y) — > 0 as (x, y) — > (0, 0). 

These relations have been written taking account of the relation Fix, fix)) = 0 
and the fact that the continuity of the partial derivatives F' x and F' at (0, 0) guaran- 
tees the differentiability of the function Fix, y) at that point. 

For convenience in writing we set a := ||L|| and b := || (F' ( 0, 0)) _1 1|. 

Taking account of the relations 

|/(x)| = |/(x) — Lx + Lx | < | fix) — Lx | + |Lx| < |/(x) — Lx| +a\x\, 


we can extend the preliminary computation just done and obtain the relation 
| fix) - Lx | < ba(x, f (x))((a + l)|x| + |/(x) - Lx|), 


or 


| fix) - Lx | 


ia + \)b 


1 - baix, fix)) 


a(x, fix))\x | 


Since / is continuous at x = 0 and /(0) = 0, we also have fix) — ► 0 as x — >■ 0, 
and therefore a(x, fix)) — »■ 0 as x -> 0. 

It therefore follows from the last inequality that 


|/(x) — /(0) — Lx | = |/(x) — Lx | = o(|x|) as x — > 0. □ 


Extension 3 (Continuous differentiability of the implicit function) If in addition to 
the hypotheses of the theorem it is known that the mapping F has continuous partial 
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derivatives F' x and F' y in some neighborhood W of (x o, Vo)> then the function y = 
f(x) is continuously differentiable in some neighborhood of x o, and its derivative 
is given by the formula 

fix) = ~{F' y (x, f(x)))~ l ■ (F'(x, f(x))). (10.101) 

Proof We already know from formula (10.100) that the derivative fix) exists and 
can be expressed in the form (10.101) at an individual point x at which the transfor- 
mation Ffx, fix)) is invertible. 

It remains to be verified that under the present hypotheses the function fix) is 
continuous in some neighborhood of x — xq. 

The bilinear mapping (A, B) i->- A ■ B - the product of linear transformations A 
and B - is a continuous function. 

The transformation B — — F’ x (x , fix)) is a continuous function of x, being the 
composition of the continuous functions x i-* ix, fix)) m- — Ffx, fix)). 

The same can be said about the linear transformation A -1 = F’fx, fix)). 

It remains only to recall (see Example 6 of Sect. 10.3) that the mapping A -1 m- A 
is also continuous in its domain of definition. 

Thus the function fix) defined by formula (10.101) is continuous in some 
neighborhood of x = xo, being the composition of continuous functions. □ 

We can now summarize and state the following general proposition. 

Proposition If in addition to the hypotheses of the implicit function theorem it 
is known that the function F belongs to the class C^ k \W, Z), then the function 
y = fix) defined by the equation Fix, y) — 0 belongs to C (k fU, Y) in some neigh- 
borhood U of xo. 

Proof The proposition has already been proved for k — 0 and k — 1 . The general 
case can now be obtained by induction from formula (10.101) if we observe that 
the mapping £(T; Z)3 Ai-> A -1 e C{Z\ Y) is (infinitely) differentiable and that 
when Eq. (10.101) is differentiated, the right-hand side always contains a deriva- 
tive of / one order less than the left-hand side. Thus, successive differentiation of 
Eq. (10.101) can be carried out a number times equal to the order of smoothness of 
the function F. □ 

In particular, if 

fix)h\ = ~(Fy(x, fix))) 1 • {F’ x (x, f{x)))h\. 


then 

f"ix)ih u h 2 ) = -d (F' y (x, fix)))~ l h 2 F'(x, fix))h j - 

- ( F' y (x , fix))) -1 d (F'{x, fix))h\)h 2 = 
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= {F' y {x, f{x))) 1 d F' y (x, f{xj)h 2 {F' y (x , f{x))) ’ x 

x Ki x > f(x))hi - (Fy(x, fix )))- 1 x 
x ((F" (*, /(x)) + F’’ y (x, fix)) f' ix))h\)h 2 = 

= ( Fy(x , fix))) ] ((F” x (x, fix)) + Fyy(x, fix))f\x))h 2 ) x 

X ( F yi x ’ f(x)))~ l K{ x ’ f(x))h 1 - { F y (x, / O '))) -1 x 

x ((F”Jx, fix)) + F" y (x, fix))f'ix))ln)h 2 . 

In less detailed, but more readable notation, this means that 


f"ix)Un,h 2 ) = (F’y) '[({F; x + F'; y f)h 2 )(F'y) 1 F x h \ - ((C + F yv f')hi)h 2 \. 

(10.102) 

In this way one could theoretically obtain an expression for the derivative of an 
implicit function to any order; however, as can be seen even from formula (10. 102), 
these expressions are generally too cumbersome to be conveniently used. Let us 
now see how these results can be made specific in the important special case when 
X = R"\ Y = R n , and Z = R" . 

In this case the mapping z= Fix, y) has the coordinate representation 


z 1 = F l (x 1 , . . . , x m ,y 1 y n ), 

Z n = F n {x\...,x m ,y\...,y n ). 


(10.103) 


The partial derivatives F’ x e £(R m ; R") and F’ y e £(R"; R") of the mapping are 
defined by the matrices 


/ 3 F 1 

' 

3F 1 \ 
3.x m 

II 

/ 3F 1 

3y‘ 

3 F l \ 

dy n 



y 



3 F" 

3 F" 


3 F n 

8 F n 


computed at the corresponding point (x, y). 

As we know, the condition that F' x and F y be continuous is equivalent to the 
continuity of all the entries of these matrices. 

The invertibility of the linear transformation F'ix o, yo) £ £(R"; R") is equiva- 
lent to the nonsingularity of the matrix that defines this transformation. 

Thus, in the present case the implicit function theorem asserts that if 

F l ix 1 0 ,...,x^,y 1 0 ,...,y") = 0, 

1) i 

F«(x x ( 7,y >' 0 ") = 0; 
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2) F l (x , . . . , x m , y , . . . , y n ), i = 1 n, are continuous functions at the 

point (xq , . . . , x™, )>q , . . . , Vq) e R'" x R" ; 

3) all the partial derivatives | ^y(x 1 ,...,x m ,y 1 ,...,y n ),i = 1 j = 

1 are defined in a neighborhood of (x^, . . . , x™, Vq, . . . , y ( ") and are con- 
tinuous at this point; 

4) the determinant 

8F 1 8F 1 

By 1 " ’ dy n 


a F n a F n 

8.V 1 " ' 8y" 

of the matrix F' y is nonzero at the point (xq , . . . , Xq" , , . . . , Vq ) ; then there ex- 
ist a neighborhood U of xq = (Xq , ,x™) e R m , a neighborhood V of yo = 
(Vq, . . . , y[j) e R", and a mapping / : U — >■ V having a coordinate representation 

y l = f l (x 1 , . . . , x m ), 

: (10.104) 

y n = f n (x\...,x m ), 


such that 

1') inside the neighborhood U x V of (x^ , . . . , x™ , Vq , . . . , Vq ) e R m x R" the 
system of equations 


F 1 (x 1 , . . . , x m , v 1 , . . . , y” ) =0, 
F n (x\...,x m ,y 1 ,...,y H ) = 0 

is equivalent to the functional relation / : U —*■ V expressed by (10.104); 
2 ') 


^0 =f 1 ( x O’---’ x o)’ 

y n o = r(4 * ”); 

30 the mapping (10.104) is continuous at (x^, . . . , x™, Vq, . . . , y{j). 

If in addition it is known that the mapping (10.103) belongs to the class C*- , 
then, as follows from the proposition above, the mapping (10.104) will also belong 
to C®, of course within its own domain of definition. 
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In this case formula (10. 101) can be made specific, becoming the matrix equality 


/a / 1 

ax 1 

3/ 1 \ 

dx'" 


/ a f 1 
ay 1 

a f 1 \ 

dy " 

— I 

/ 

dx 1 

fe. ■ ■ 

a/" 

\ ax 1 " 

a/" 

dx m / 


dF" 

V ay 1 

dF" 
dy" / 


. dF" 

' dx 1 

dF" 
dx m / 


in which the left-hand side is computed at (x 1 , and the right-hand side 

at the corresponding point (x 1 , . . . , x m , y 1 , ... ,y"), where y' = f‘ (x 1 , . . . , x m ), 
i = \, . . . ,n. 

If n = 1 , that is, when the equation 

F(x\...,x m ,y) = 0 


is being solved for y, the matrix F' y consists of a single entry - the number 
IpC* 1 , . . . , x m , y). In this case y = fix 1 , . . . , x m ), and 


df_ df_ 

3x' ’ ’ dx m 


/3F\ -1 / dF 


dF 


\ dy J \ dx 1 ’ ’ dx" 


(10.105) 


In this case formula (10.102) also simplifies slightly; more precisely, it can be 
written in the following more symmetric form: 


f"(x)(h 1 ,h 2 ) = - 


(F. 


xx + F xy f')h\ F y hi - (F yx + F'' y f)h 2 F’ x hi 

(f;) 2 


(10.106) 


And if n — 1 and m = 1, then y = fix) is a real- valued function of one real 
argument, and formulas (10.105) and (10.106) simplify to the maximum extent, 
becoming the numerical equalities 


fix) = 
fix) = 


F' 

~£<x.y). 


iF" 

V 1 X. 


F'f y f')F' y - iF'; x + F'; y f')F' x 
(F y ) 2 


(x, v) 


for the first two derivatives of the implicit function defined by the equation 
F(x,y) = 0. 


10.7.1 Problems and Exercises 

1. a) Assume that, along with the function f \U —*■ Y given by the implicit func- 
tion theorem, we have a function / : U —*■ Y defined in some neighborhood U of xq 
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and satisfying yo — fix o) and F(x, f(x )) = 0 in U. Prove that if / is continuous 
at xo , then the functions / and / are equal on some neighborhood of xo . 

b) Show that the assertion in a) is generally not true without the assumption that 
/ is continuous at xq . 

2. Analyze once again the proof of the implicit function theorem and the extensions 
to it, and show the following. 

a) If z = F(x, y) is a continuously differentiable complex-valued function of 
the complex variables x and y, then the implicit function y — f (x ) defined by the 
equation F{x, y) = 0 is differentiable with respect to the complex variable x. 

b) Under the hypotheses of the theorem X is not required to be a normed space, 
and may be any topological space. 

3. a) Determine whether the form /"(x)(/?i , /z 2 ) defined by relation (10.102) is 
symmetric. 

b) Write the forms (10.101) and (10.102) for the case of numerical functions 
F(x 1 , x 2 , y) and F(x, y 1 , y 2 ) in matrix form. 

c) Show that if M 9 t i->- Ait) e £(R"; K") is family of nonsingular matrices 
A(t) depending on the parameter t in an infinitely smooth manner, then 



where A 1 = A 1 it) 


denotes the inverse of the matrix A = Ait). 

4 . a) Show that Extension 1 to the theorem is an immediate corollary of the stabil- 
ity conditions for the fixed point of the family of contraction mappings studied in 


Sect. 9.7. 


b) Let { A r : X —>■ X] be a family of contraction mappings of a complete normed 
space into itself depending on the parameter t, which ranges over a domain Q in 
a normed space T. Show that if A t (x) = q>(t , x) is a function of class C (n} (Q x 
X , X), then the fixed point x ( t ) of the mapping A, belongs to class C ln) (Q , X) as 
a function of t. 

5 . a) Using the implicit function theorem, prove the following inverse function the- 
orem. 

Let g : G — »■ X be a mapping from a neighborhood G of a point yo in a complete 
normed space Y into a normed space X. If 

1° the mapping x = giy) is differentiable in G, 

2° g'(y) is continuous at yo, 

3° g'(yo) is an invertible transformation, 

then there exists a neighborhood V C Y of yo and a neighborhood U C A of xo 
such that g : V -* U is bijective, and its inverse mapping / : U — »• V is continuous 
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in U and differentiable at xo : moreover, 

/'(to) = U'(vo))" 1 . 

b) Show that if it is known, in addition to the hypotheses given in a), that the 
mapping g belongs to the class C (n \V, U ), then the inverse mapping / belongs to 
C (n \U, V). 

c) Let / : R" -> R" be a smooth mapping for which the matrix fix) is non- 
singular at every point x e R" and satisfies the inequality ||(/ , ) _1 (t)II < C with a 
constant C that is independent of x. Show that / is a bijective mapping. 

d) Using your experience in solving c), try to give an estimate for the radius 
of a spherical neighborhood U = B{x o, r) centered at xq in which the mapping 
f : U V studied in the inverse function theorem is necessarily defined. 

6. a) Show that if the linear mappings A e C(X: Y) and B e €.(X: R) are such that 
ker A C ker B (here ker, as usual, denotes the kernel of a transformation), then there 
exists a linear mapping X e £(Y ; R), such that B — X ■ A. 

b) Let X and Y be normed spaces and / : X — » R and g : X -> Y smooth func- 
tions on X with values in R and Y respectively. Let S be the smooth surface defined 
in X by the equation g{x) — yo- Show that if xo e S is an extremum of the function 
/Is, then any vector h tangent to S at A/ simultaneously satisfies two conditions: 
f'(xo)h — 0 and g'(xo)h — 0. 

c) Prove that if *0 € S is an extremum of the function f\s then f(x o) = X ■ 
g'(x o), where X € C(Y\ R). 

d) Show how the classical Lagrange necessary condition for an extremum with 
constraint of a function on a smooth surface in R" follows from the preceding result. 

7. As is known, the equation z n + c\z n ~ 1 H h c n — 0 with complex coefficients 

has in general n distinct complex roots. Show that the roots of the equation are 
smooth functions of the coefficients, at least where all the roots are distinct. 

8. a) Following Hadamard, prove that a continuous locally invertible mapping f : 
R" — > R" is globally invertible (i.e., it is bijective) if and only if f(x) -> oo as 
x — > oc . Convince yourself that we can consider here any normed space instead 
of R". How should we interpret (or reformulate) Hadamard ’s conditions if we now 
consider the image of R" or a normed space under a homeomorphism? 

b) Let F:Jfxf->Zbea continuous mapping defined on the direct product of 
the normed spaces X and Y . Show that the equation F (x , y) = 0 is solvable glob- 
ally with respect to y (in the sense that the local continuous solution of y = fix) 
extends as such to the whole space X) exactly when the following two conditions 
are fulfilled: the equation has a continuous solution in a neighborhood of every 
point (xo, yo) satisfying F(x o, yo) = 0; and in the pair (x, y) satisfying the equa- 
tion F(x, y) = 0, the second coordinate can tend to infinity (changing continuously) 
only if the first coordinate in its space also tends to infinity. 
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c) Following John, 1 show that if a continuous locally invertible mapping / : 
B H from the unit ball H to a normed space H is such that locally (at every point 
of the ball), it changes the element of length no more than k > 1 times (expanding 
or contracting), then in the ball of radius lc~ 2 , this mapping is injective. (Caution: 
an infinite-dimensional normed space can be isometrically embedded into itself as 
a proper subspace through a shift of coordinates, but this mapping is not invertible 
or locally invertible. It is invertible as a mapping only on its image.) 


6 F. John (1910-1994), German-born and later a famous American mathematician, student of 
R. Courant. 


Chapter 11 

Multiple Integrals 


11.1 The Riemann Integral over an n -Dimensional Interval 

11.1.1 Definition of the Integral 
a. Intervals in R" and Their Measure 

Definition 1 The set I — {x e R" | a' < x‘ <b‘,i — 1 is called an interval 
or a coordinate parallelepiped in R” . 

If we wish to note that the interval is determined by the points a = (a 1 , . . . , a") 
and b = (b\ . . . , b n ), we often denote it I a j,, or, by analogy with the one- 
dimensional case, we write it as a < x < b. 

Definition 2 To the interval I — {x e R" a' <x‘ <b',i = 1 we assign the 
number |/| := ]~[ 1=i(b l ~ a ‘h called the volume ox measure of the interval. 

The volume (measure) of the interval I is also denoted v(I) and /x(/). 

Lemma 1 The measure of an inter\’al in R" has the following properties. 

a) It is homogeneous, that is, ifXI a j, I\ a ,\b. where X > 0, then 

\U a , b \=^\Ia.b\- 

b) It is additive, that is, if the intervals l . I\ f- are such that I — Uy = i Ij 
and no two of the intervals I\, . . . , f have common interior points, then \ I \ = 

T!j=l l 7 ;l- 

c) If the interval I is covered by a finite system of intervals I\, ..., Ik, that is, 

icUU I j’ then \i\^T, k i =i\ I j\- 

All these assertions follow easily from Definitions 1 and 2. 
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b. Partitions of an Interval and a Base in the Set of Partitions 

Suppose we are given an interval / = {.?£ I" | a' < x‘ <b‘,i — 1, . . . , n}. Par- 
titions of the coordinate intervals [a ‘ , b‘], i = 1 induce a partition of the 
interval I into finer intervals obtained as the direct products of the intervals of the 
partitions of the coordinate intervals. 

Definition 3 The representation of the interval I (as the union I — U^=i Ij °f finer 
intervals Ij) just described will be called a partition of the interval I , and will be 
denoted by P . 

Definition 4 The quantity X(P) := max | <j<k ddj) (the maximum among the di- 
ameters of the intervals of the partition P) is called the mesh of the partition P. 

Definition 5 If in each interval Ij of the partition P we fix a point qj e Ij , we say 
that we have a partition with distinguished points. 

The set {§i, . . . , fj.}, as before, will be denoted by the single letter §, and the 
partition with distinguished points by ( P . f ). 

In the set V — { ( P , f ) } of partitions with distinguished points on an interval I we 
introduce the base X(P) — > 0 whose elements B t i(d > 0), as in the one-dimensional 
case, are defined by B c i := {(P, f ) e V \ X(P) < d}. 

The fact that B — { B c i } really is a base follows from the existence of partitions of 
mesh arbitrarily close to zero. 


c. Riemann Sums and the Integral 

Let /:/—>• R be a real- valued 1 function on the interval / and P = {I\, . . . , 1^} a 
partition of this interval with distinguished points £ = {|i, 


Definition 6 The sum 


k 

i=t 


is called the Riemann sum of the function / corresponding to the partition of the 
interval I with distinguished points (P, §). 


Definition 7 The quantity 



lim 

UP)-> 0 


'Please note that in the following definitions one could assume that the values of / lie in any 
normed vector space. For example, it might be the space C of complex numbers or the spaces R" 
and C". 
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provided this limit exists, is called the Riemann integral of the function / over the 
interval I . 

We see that this definition, and in general the whole process of constructing the 
integral over the interval I C R" is a verbatim repetition of the procedure of defining 
the Riemann integral over a closed interval of the real line, which is already familiar 
to us. To highlight the resemblance we have even retained the previous notation 
f(x)dx for the differential form. Equivalent, but more expanded notations for the 
integral are the following: 



To emphasize that we are discussing an integral over a multidimensional domain 
/ we say that this is a multiple integral (double, triple, and so forth, depending on 
the dimension of I). 


d. A Necessary Condition for Integrability 

Definition 8 If the finite limit in Definition 7 exists for a function /:/—»■ R, then 
/ is Riemann integrable over the interval I . 

We shall denote the set of all such functions by 1Z(I). 

We now verify the following elementary necessary condition for integrability. 
Proposition 1 / e 'JZ( I) =>■ f is bounded on I . 

Proof Let P be an arbitrary partition of the interval I . If the function / is un- 
bounded on /, then it must be unbounded on some interval /,„ of the partition P. 
If (P, £') and ( P , f") are partitions P with distinguished points such that f and 
differ only in the choice of the points £/ and , then 

\a(f, P,r) - o{f, P,*")| = | m 0 ) - fWM ol- 

By changing one of the points §7 and f ", as a result of the unboundedness of / 
in we could make the right-hand side of this equality arbitrarily large. By the 
Cauchy criterion, it follows from this that the Riemann sums of / do not have a 
limit as k(P) — > 0. □ 
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11.1.2 The Lebesgue Criterion for Riemann Integrability 

When studying the Riemann integral in the one-dimensional case, we acquainted the 
reader (without proof) with the Lebesgue criterion for the existence of the Riemann 
integral. We shall now recall certain concepts and prove this criterion. 


a. Sets of Measure Zero in R" 

Definition 9 A set E C R" has (n -dimensional) measure zero or is a set of measure 
zero (in the Lebesgue sense) if for every e > 0 there exists a covering of E by an 
at most countable system {/,-} of n -dimensional intervals for which the sum of the 
volumes \I, | does not exceed e. 

Lemma 2 a) A point and a finite set of points are sets of measure zero. 

b) The union of a finite or countable number of sets of measure zero is a set of 
measure zero. 

c) A subset of a set of measure zero is itself of measure zero. 

d) A nondegenerate 2 interval l a j, C R" is not a set of measure zero. 

The proof of Lemma 2 does not differ from the proof of its one-dimensional 
version considered in Sect. 6.1.3, paragraph d. Hence we shall not give the details. 

Example 1 The set of rational points in R” (points all of whose coordinates are 
rational numbers) is countable and hence is a set of measure zero. 

Example 2 Let / : I — > 1 be a continuous real- valued function defined on an 
(n — 1) -dimensional interval I C R' ,_1 . We shall show that its graph in R" is a 
set of n -dimensional measure zero. 

Proof Since the function / is uniformly continuous on 7, for e > 0 we find 8 > 0 
such that \f(xi) — f(x 2 )\ < e for any two points xi, xi e 1 such that |xi — xj\ < 8. 
If we now take a partition P of the interval I with mesh k(T’) < 8, then on each 
interval 7, of this partition the oscillation of the function is less than e. Hence, 
if xi is an arbitrary fixed point of the interval 7, , the n -dimensional interval 7, = 
Ii x \f(xj) — e, f(xi) + e] obviously contains the portion of the graph of the function 
lying over the interval 7, , and the union [_J ( 7, covers the whole graph of the function 
over 7 . But 1 7; | = JA 1 7,- 1 • 2e = 2e \ I \ (here 1 7; | is the volume of 7 , in R" - 1 and 
1 7; | the volume of 7/ in R"). Thus, by decreasing s, we can indeed make the total 
volume of the covering arbitrarily small. □ 


2 That is, an interval I a ^ =|tel" | a' < x‘ <b l ,i= 1 such that the strict inequality 
a‘ < b 1 holds for each i 6 {1 
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Remark 1 Comparing assertion b) in Lemma 2 with Example 2, one can conclude 
that in general the graph of a continuous function / : K" _1 ->M ora continuous 
function / : M R, where M C R" _1 , is a set of n-dimensional measure zero 
in M" . 

Lemma 3 a) The class of sets of measure zero remains the same whether the in- 
tervals covering the set E in Definition 9, that is, E C U, are interpreted as an 
ordinary system of intervals {/;}, or in a stricter sense, requiring that each point of 
the set be an interior point of at least one of the intervals in the covering? 

b) A compact set K in M" is a set of measure zero if and only if for every e > 0 
there exists a finite covering of K by intervals the sum of whose volumes is less 
than e. 

Proof a) If {/,■} is a covering of E (that is, E C (J ; h and 12i I A' I < e), then, re- 
placing each /, by a dilation of it from its center, which we denote /, , we obtain 
a system of intervals { /, } such that f? \Ii\< X n e, where A. is a dilation coefficient 
that is the same for all intervals. If X > 1, it is obvious that the system {/,} will 
cover E in such a way that every point of E is interior to one of the intervals in the 
covering. 

b) This follows from a) and the possibility of extracting a finite covering from any 
open covering of a compact set K . (The system { /,• \ 3 /, } consisting of open intervals 
obtained from the system { /,• } considered in a) may serve as such a covering.) □ 


b. A Generalization of Cantor’s Theorem 

We recall that the oscillation of a function / : E — > M on the set E has been defined 
as co(f; E) := sup^ X2<eE \f(x\) — f{xf)\, and the oscillation at the point x e E as 
c o(f\x ) := lim^o <u(/; U E (x)), where U E (x) is the 5 -neighborhood of x in the 
set E. 

Lemma 4 If the relation co(f;x) < coo holds at each point of a compact set K 
for the function f : K — > R, then for every e > 0 there exists 8 > 0 such that 
co(f; U & K {x)) < coo + £ for each point x e K. 

When coq — 0, this assertion becomes Cantor’s theorem on uniform continuity of 
a function that is continuous on a compact set. The proof of Lemma 4 is a verbatim 
repetition of the proof of Cantor’s theorem (Sect. 6.2.2) and therefore we do not take 
the time to give it here. 


3 


In other words, it makes no difference whether we mean closed or open intervals in Definition 9. 
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c. Lebesgue’s Criterion 

As before, we shall say that a property holds at almost all points of a set M or almost 
everywhere on M if the subset of M where this property does not necessarily hold 
has measure zero. 

Theorem 1 (Lebesgue’s criterion) / e R{ I) <a (/ is bounded on /) A (/ is con- 
tinuous almost everywhere on I). 

Proof Necessity. If / e then by Proposition 1 the function / is bounded on I . 
Suppose |/| < M on I. 

We shall now verify that / is continuous at almost all points of I . To do this, we 
shall show that if the set E of its points of discontinuity does not have measure zero, 
then / 

Indeed, representing E in the form E = \J%L\E n , where E n — {x e I \ 
&>(/; x) > 1 / n), we conclude from Lemma 2 that if E does not have measure zero, 
then there exists an index «o such that E„ 0 is also not a set of measure zero. Let P 
be an arbitrary partition of the interval I into intervals { /,• } . We break the partition 
P into two groups of intervals A and B, where 



The system of intervals A forms a covering of the set E no . In fact, each point 
of E„ 0 lies either in the interior of some interval /; e P, in which case obviously 
Ij e A , or on the boundary of several intervals of the partition P. In the latter case, 

the oscillation of the function must be at least J- on at least one of these intervals 

2no 

(because of the triangle inequality), and that interval belongs to the system A. 

We shall now show that by choosing the set f of distinguished points in the 
intervals of the partition P in different ways we can change the value of the Riemann 
sum significantly. 

To be specific, we choose the sets of points t-' and f such that in the intervals 
of the system B the distinguished points are the same, while in the intervals /, of 
the system A, we choose the points §( and so that /(§/) — /(£") > We then 
have 



a(/, P,H") | = £(/&) - f&))\Ii\ 


l/'l > c > 0. 


Ij£A 


The existence of such a constant c follows from the fact that the intervals of the 
system A form a covering of the set E no , which by hypothesis is not a set of measure 
zero. 

Since P was an arbitrary partition of the interval I, we conclude from the Cauchy 
criterion that the Riemann sums cr(/, P . f ) cannot have a limit as /, ( P ) 0, that 
is ,ftK(I). 
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Sufficiency. Let e be an arbitrary positive number and E e = {x e I \ o>( f: x) > e}. 
By hypothesis, E e is a set of measure zero. 


Moreover, E e is obviously closed in 7, so that E, is compact. By Lemma 3 
there exists a finite system of intervals in R" such that E e C U/=i 4 an< d 

1 141 < £• Let us set Ci = Uf=i 4 and denote by C 2 and C 3 the unions of the 
intervals obtained from the intervals 7, by dilation with center at the center of 7, and 
scaling factors 2 and 3 respectively. It is clear that E e lies strictly in the interior of C 2 
and that the distance d between the boundaries of the sets C 2 and C 3 is positive. 

We note that the sum of the volumes of any finite system of intervals lying in C 3 , 
no two of which have any common interior points is at most 3 "s, where n is the 
dimension of the space R' ! . This follows from the definition of the set C 3 and prop- 
erties of the measure of an interval (Lemma 1). 

We also note that any subset of the interval 7 whose diameter is less than d is 
either contained in C 3 or lies in the compact set K — 7 \(C 2 \ 3 C 2 ), where 3 C 2 is the 
boundary of C 2 (and hence C 2 \ 3 C 2 is the set of interior points of C 2 ). 

By construction E e C / \ K , so that at every point x e K we must have 
&>(/; x) < s. By Lemma 4 there exists S > 0 such that \f(xi) — f(x 2 )\ < 2 s for 
every pair of points x \ , X 2 e K whose distance from each other is at most <5. 

These constructions make it possible now to carry out the proof of the sufficient 
condition for integrability as follows. We choose any two partitions P' and P" of 
the interval 7 with meshes 7.(7’') and ),( P" ) less than 7. = minjr/, 3}. Let P be the 
partition obtained by intersecting all the intervals of the partitions P' and P" , that 
is, in a natural notation, P — {Ijj = I\ fl 7"}. Let us compare the Riemann sums 
a(/, P, £) and cr(/, P',%'). Taking into account the equality \I-\ = \Uj l> we 
can write 


r(/, P,t-')-a{f, P,?)| = - /(&,■)) 1 4 


< 


< Ej/feO - /(^)|l 4 ;l +E 2 l^feO - /^)|l 4 ;|. 


Here the first sum | contains the intervals of the partition P lying in the inter- 
vals I' of the partition P' contained in the set C 3 , and the remaining intervals of P 
are included in the sum , that is, they are all necessarily contained in K (after 
all, X(P) < d). 

Since |/| < M on 7, replacing | f(£-) — \ in the first sum by 2 M, we con- 

clude that the first sum does not exceed 2 M ■ 3" s. 

Now, noting that e Ij C K in the second sum and ).( P 1 ) < 8 , we conclude 
that |/(^0 — f(^ij ) | < 2s, and consequently the second sum does not exceed 2e|7|. 

Thus |er (/, P',t-') — a(f, P,%) \ < (2 M • 3” +2|7|)e, from which (in view of the 
symmetry between P ' and P"), using the triangle inequality, we find that 


a(f, P\ r) - <T (/, P" , r') I < 4(3 n M+ |7|)e 
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for any two partitions P' and P" with sufficiently small mesh. By the Cauchy crite- 
rion we now conclude that f e 1Z(I). □ 

Remark 2 Since the Cauchy criterion for existence of a limit is valid in any complete 
metric space, the sufficiency part of the Lebesgue criterion (but not the necessity 
part), as the proof shows, holds for functions with values in any complete normed 
vector space. 


11.1.3 The Darboux Criterion 

Let us consider another useful criterion for Riemann integrability of a function, 
which is applicable only to real-valued functions. 


a. Lower and Upper Darboux Sums 

Let / be a real-valued function on the interval I and P — { I , } a partition of the 
interval I . We set 


in, = inf f(x), Mj = sup fix). 
x€ h xeli 


Definition 10 The quantities 

sif,P) = J2 m i\ I i\ and Sif,P) = J2 M \h\ 

i i 

are called the lower and upper Darboux sums of the function / over the interval I 
corresponding to the partition P of the interval. 

Lemma 5 The following relations hold between the Darboux sums of a function 
/:/—>• R: 

a) sif, P) = infj ex if, P, $) < ct if, P, H) < sup ? a(/, P , $) = S(f P); 

b) if the partition P' of the inten’al I is obtained by refining intervals of the 
partition P, then sif, P) < sif, P') < Sif, P') < Sif, P); 

c) the inequality sif, Pi) < Sif, Pf) holds for any pair of partitions P\ and Pi 
of the interval I . 

Proof Relations a) and b) follow immediately from Definitions 6 and 10, taking 
account, of course, of the definition of the greatest lower bound and least upper 
bound of a set of numbers. 

To prove c) it suffices to consider the auxiliary partition P obtained by intersect- 
ing the intervals of the partitions P\ and Pi - The partition P can be regarded as a 
refinement of each of the partitions P\ and Pi, so that b) now implies 


sif, P\) < sif, P) < Sif, P) < Sif, P 2 ). 


□ 
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b. Lower and Upper Integrals 

Definition 11 The lower and upper Darboux integrals of the function / : / — > R 
over the interval I are respectively 

J = sups(/, P), J = inf S(f, P), 

— P p 

where the supremum and infimum are taken over all partitions P of the interval I . 

It follows from this definition and the properties of Darboux sums exhibited in 
Lemma 3 that the inequalities 

s{f,P)<J<J<S{f,P) 

hold for any partition P of the interval. 

Theorem 2 (Darboux) For any bounded function f : I -> M, 

(3 bin s(f,P)) A( lim s(f,P) = j\ 

(3 lim S(f,P)) A( lim S(f, P) — ~j). 

V \(P)^>o / \x(P)—>o / 

Proof If we compare these assertions with Definition 11, it becomes clear that in 
essence all we have to prove is that the limits exist. We shall verify this for the lower 
Darboux sums. 

Fix s > 0 and a partition P e of the interval I for which .v(/; P £ ) > fj — e. Let 
r £ be the set of points of the interval I lying on the boundary of the intervals of the 
partition P £ . As follows from Example 2, is a set of measure zero. Because of 
the simple structure of r e , it is even obvious that there exists a number 7. f; such that 
the sum of the volumes of those intervals that intersect r £ is less than e for every 
partition P such that k ( P ) < k £ . 

Now taking any partition P with mesh 7, ( P ) < k £ , we form an auxiliary partition 
P' obtained by intersecting the intervals of the partitions P and P £ . By the choice 
of the partition P £ and the properties of Darboux sums (Lemma 5), we find 

J_ — £ < s(f, P £ ) < s(f, P') < J_. 

We now remark that the sums .y ( /, P') and s(f, P) both contain all the terms 
that correspond to intervals of the partition P that do not meet F s . Therefore, if 
|/(x)| < M on /, then 

\s{f,P')-s(f,P)\<2Me 

and taking account of the preceding inequalities, we thereby find that for k(P) < k £ 
we have the relation 


J-s{f , P) < (2M+ l)e. 
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Comparing the relation just obtained with Definition 1 1 , we conclude that the limit 
lim ; ,( /1 )^o s(f, P) does indeed exist and is equal to J . 

Similar reasoning can be carried out for the upper sums. □ 


c. The Darboux Criterion for Integrability of a Real-Valued Function 

Theorem 3 (The Darboux criterion) A real-valued function f : I —*■ M defined on 
an inten’al I C R" is integrable over that interval if and only if it is bounded on I 
and its upper and lower Darboux integrals are equal. 

Thus, 

f G 7 Z(I) •<=>■ (/ is bounded on I) A ( J_ = J). 

Proof Necessity. If / g 1Z(I), then by Proposition 1 the function / is bounded on I . 
It follows from Definition 7 of the integral, Definition 1 1 of the quantities J and J . 
and part a) of Lemma 5 that in this case J_ — J . 

Sufficiency. Since s(f, P) < o(f, P, f) < S ( /, P) when J_ = J , the extreme terms 
in these inequalities tend to the same limit by Theorem 2 as /,( P) — »• 0. Therefore 
er(/, P , f ) has the same limit as X(P) — > 0. □ 

Remark 3 It is clear from the proof of the Darboux criterion that if a function is 
integrable, its lower and upper Darboux integrals are equal to each other and to the 
integral of the function. 


11.1.4 Problems and Exercises 

1. a) Show that a set of measure zero has no interior points. 

b) Show that not having interior points by no means guarantees that a set is of 
measure zero. 

c) Construct a set having measure zero whose closure is the entire space R" . 

d) A set E C / is said to have content zero if for every e > 0 it can be covered 
by a finite system of intervals I\, .... h such that Y^=i I h I < £. Is every bounded 
set of measure zero a set of content zero? 

e) Show that if a set E C R" is the direct product Rxeof the line R and a set 
e C R" -1 of (n — 1) -dimensional measure zero, then E is a set of n -dimensional 
measure zero. 

2. a) Construct the analogue of the Dirichlet function in R” and show that a 
bounded function /:/—>• R equal to zero at almost every point of the interval I 
may still fail to belong to 

b) Show that if / e 7 Z(I) and / (x ) = 0, at almost all points of the interval I , 
then fj /(x)d.Y = 0. 
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3. There is a small difference between our earlier definition of the Riemann inte- 
gral on a closed interval I cl and Definition 7 for the integral over an interval 
of arbitrary dimension. This difference involves the definition of a partition and the 
measure of an interval of the partition. Clarify this nuance for yourself and verify 
that 



and 



where I is the interval on the real line R with endpoints a and b. 

4 . a) Prove that a real-valued function / : I — > R defined on an interval / C R" is 
integrable over that interval if and only if for every s > 0 there exists a partition P 
of I such that S(f; P ) — s(f; P ) < e. 

b) Using the result of a) and assuming that we are dealing with a real-valued 
function / : I -> R, one can simplify slightly the proof of the sufficiency of the 
Lebesgue criterion. Try to carry out this simplification by yourself. 


11.2 The Integral over a Set 
11.2.1 Admissible Sets 

In what follows we shall be integrating functions not only over an interval, but also 
over other sets in R" that are not too complicated. 

Definition 1 A set E C R" is admissible if it is bounded in R” and its boundary is 
a set of measure zero (in the sense of Lebesgue). 

Example 1 A cube, a tetrahedron, and a ball in R 3 (or R") are admissible sets. 

Example 2 Suppose the functions q>i : I — > R, i = 1 , 2 , defined on an (n — 1)- 
dimensional interval I C R" are such that tp\(x) < <p2(x) at every point x e I. If 
these functions are continuous. Example 2 of Sect. 11.1 makes it possible to assert 
that the domain in R" bounded by the graphs of these functions and the cylindrical 
lateral surface lying over the boundary 3/ of I is an admissible set in R". 

We recall that the boundary dE of a set E C R" consists of the points x such that 
every neighborhood of x contains both points of E and points of the complement of 
E in R" . Hence we have the following lemma. 


Lemma 1 For any sets E , E \ , £2 C R”, the following assertions hold'. 
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a) 3 E is a closed subset ofM ." ; 

b) 3(£iU£ 2 )ca£'iU3£ 2 ; 

c) 3(£j n Ef) C 3£iU3£ 2 ; 

d) 3(£t\£ 2 ) C dE x U3£ 2 . 

This lemma and Definition 1 together imply the following lemma. 

Lemma 2 The union or intersection of a finite number of admissible sets is an ad- 
missible set, the difference of admissible sets is also an admissible set. 

Remark 1 For an infinite collection of admissible sets Lemma 2 is generally not 
true, and the same is the case with assertions b) and c) of Lemma 1. 

Remark 2 The boundary of an admissible set is not only closed, but also bounded 
in K" , that is, it is a compact subset of R" . Hence by Lemma 3 of Sect. 1 1 . 1, it can 
even be covered by a finite set of intervals whose total content (volume) is arbitrarily 
close to zero. 

We now consider the characteristic function 

[ 1, if x e E, 

Xe(x)=\ 

0, if x f E, 

of an admissible set E. Like the characteristic function of any set E, the function 
Xe(x) has discontinuities at the boundary points of the set E and at no other points. 
Hence if E is an admissible set, the function xe (x) is continuous at almost all points 
of R” . 


11.2.2 The Integral over a Set 

Let / be a function defined on a set E. We shall agree, as before, to denote the 
function equal to /(x) for x e E and to 0 outside E by fxE(x) (even though / 
may happen to be undefined outside of E). 

Definition 2 The integral of f over E is given by 

[ /(x)dx := [ f xe (x) dx , 

Je Jide 

where I is any interval containing E. 

If the integral on the right-hand side of this equality does not exist, we say that / 
is (Riemann) nonintegrable over E. Otherwise / is (Riemann) integrable over E. 
The set of all functions that are Riemann integrable over E will be denoted 7 Z{E). 
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Definition 2 of course requires some explanation, which is provided by the fol- 
lowing lemma. 

Lemma 3 If !\ and h are two intervals, both containing the set E, then the inte- 
grals 


/ fXE(x) dx and / fxs(x)dx 

Ji i Jh 


either both exist or both fail to exist, and in the first case their values are the same. 

Proof Consider the interval / = I\ fl h. By hypothesis / D E. The points of dis- 
continuity of f xe are either points of discontinuity of / on E, or the result of 
discontinuities of xe. in which case they lie on 3£. In any case, all these points 
lie in I = I\ fl h. By Lebesgue’s criterion (Theorem 1 of Sect. 1 1.1) it follows that 
the integrals of f xe over the intervals I, I\, and I 2 either all exist or all fail to 
exist. If they do exist, we may choose partitions of I, I\, and I 2 to suit ourselves. 
Therefore we shall choose only those partitions of I\ and h obtained as extensions 
of partitions of I — I\ fl 12- Since the function is zero outside I , the Riemann sums 
corresponding to these partitions of / 1 and h reduce to Riemann sums for the cor- 
responding partition of I . It then results from passage to the limit that the integrals 
over 1\ and h are equal to the integral of the function in question over I . □ 

Lebesgue’s criterion (Theorem 1 of Sect. 11.1) for the existence of the integral 
over an interval and Definition 2 now imply the following theorem. 

Theorem 1 A function f : E — >■ R is integrable over an admissible set if and only 
if it is bounded and continuous at almost all points of E. 

Proof Compared with /, the function f xe may have additional points of discon- 
tinuity only on the boundary 3 E of E, which by hypothesis is a set of measure 
zero. □ 


11.2.3 The Measure (Volume) of an Admissible Set 

Definition 3 The (Jordan) measure or content of a bounded set E C R" is 



provided this Riemann integral exists. 


Since 
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and the discontinuities of xe form the set 3 E, we find by Lebesgue’s criterion that 
the measure just introduced is defined only for admissible sets. 

Thus admissible sets, and only admissible sets, are measurable in the sense of 
Definition 3. 

Let us now ascertain the geometric meaning of /x(E). If E is an admissible set 
then 


li{E)= / X E(x)dx= / X e(x) dx= / X£<A)d-L 

Jide Jide Jide 

where the last two integrals are the upper and lower Darboux integrals respectively. 
By the Darboux criterion for existence of the integral (Theorem 3) the measure 
/i ( E ) of a set is defined if and only if these lower and upper integrals are equal. 
By the theorem of Darboux (Theorem 2 of Sect. 11.1) they are the limits of the 
upper and lower Darboux sums of the function xe corresponding to partitions P 
of I . But by definition of xe the lower Darboux sum is the sum of the volumes 
of the intervals of the partition P that are entirely contained in E (the volume of a 
polyhedron inscribed in E), while the upper sum is the sum of the volumes of the 
intervals of P that intersect E (the volume of a circumscribed polyhedron). Hence 
H ( E ) is the common limit as /. ( P ) — >• 0 of the volumes of polyhedra inscribed in 
and circumscribed about E, in agreement with the accepted idea of the volume of 
simple solids E C R" . 

For n = 1 content is usually called length , and for n — 2 it is called area. 

Remark 3 Let us now explain why the measure /i ( E ) introduced in Definition 3 is 
sometimes called Jordan measure. 

Definition 4 A set E C M” is a set of measure zero in the sense of Jordan or a set 
of content zero if for every e > 0 it can be covered by a finite system of intervals 
1 1 , . . . , 1^ such that Y^!i= 1 1 U I < £. 

Compared with measure zero in the sense of Lebesgue, a requirement that the 
covering be finite appears here, shrinking the class of sets of Lebesgue measure 
zero. For example, the set of rational points is a set of measure zero in the sense of 
Lebesgue, but not in the sense of Jordan. 

In order for the least upper bound of the contents of polyhedra inscribed in a 
bounded set E to be the same as the greatest lower bound of the contents of poly- 
hedra circumscribed about E (and to serve as the measure /i(E) or content of E), it 
is obviously necessary and sufficient that the boundary dE of E have measure 0 in 
the sense of Jordan. That is the motivation for the following definition. 

Definition 5 A set £ is Jordan-measurable if it is bounded and its boundary has 
Jordan measure zero. 

As Remark 2 shows, the class of Jordan-measurable subsets is precisely the class 
of admissible sets introduced in Definition 1. That is the reason the measure fx(E) 
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defined earlier can be called (and is called) the Jordan measure of the (Jordan- 
measurable) set £. 


11.2.4 Problems and Exercises 

1. a) Show that if a set E C R" is such that // ( E ) = 0, then the relation /i ( E ) = 0 
also holds for the closure E of the set. 

b) Give an example of a bounded set E of Lebesgue measure zero whose closure 
E is not a set of Lebesgue measure zero. 

c) Determine whether assertion b) of Lemma 3 in Sect. 11.1 should be under- 
stood as asserting that the concepts of Jordan measure zero and Lebesgue measure 
zero are the same for compact sets. 

d) Prove that if the projection of a bounded set E C R" onto a hyperplane R ' ,_1 
has (n — 1) -dimensional measure zero, then the set E itself has n -dimensional mea- 
sure zero. 

e) Show that a Jordan-measurable set whose interior is empty has measure 0. 

2. a) Is it possible for the integral of a function / over a bounded set £, as intro- 
duced in Definition 2, to exist if E is not an admissible (Jordan-measurable) set? 

b) Is a constant function / : E —*■ R integrable over a bounded but Jordan - 
nonmeasurable set E ? 

c) Is it true that if a function / is integrable over E, then the restriction f\A of 
this function to any subset A C E is integrable over A? 

d) Give necessary and sufficient conditions on a function / : E R. defined on 
a bounded (but not necessarily Jordan-measurable) set E under which the Riemann 
integral of / over E exists. 

3. a) Let £ be a set of Lebesgue measure 0 and / : E — » R a bounded continuous 
function on E. Is / always integrable on El 

b) Answer question a) assuming that £ is a set of Jordan measure zero. 

c) What is the value of the integral of the function f in a) if it exists? 

4. The Brunn-Minkowski inequality. Given two nonempty sets A,flcR",we form 
their (vector) sum in the sense of Minkowski A + B := {a + b \ a e A , b e B] . Let 
V(£) denote the content of a set £ C R" . 

a) Verify that if A and B are standard n -dimensional intervals (parallelepipeds), 
then 

V 1/,! (A + B) > V 1/n (A) + V l/n (B). 

b) Now prove the preceding inequality (the Brunn-Minkowski inequality) for 
arbitrary measurable compact sets A and B . 

c) Show that equality holds in the Brunn-Minkowski inequality only in the fol- 
lowing three cases: when V (A + B) — 0, when A and B are singleton (one-point) 
sets, and when A and B are similar convex sets. 
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11.3 General Properties of the Integral 
11.3.1 The Integral as a Linear Functional 


Proposition 1 a) The set 1Z(E) of functions that are Riemann-integrable over a 
bounded set E C R" is a vector space with respect to the standard operations of 
addition of functions and multiplication by constants. 
b) The integral is a linear functional 

J : 1Z(E) — > M on the set 1Z(E) . 

Proof Noting that the union of two sets of measure zero is also a set of measure 
zero, we see that assertion a) follows immediately from the definition of the integral 
and the Lebesgue criterion for existence of the integral of a function over an interval. 

Taking account of the linearity of Riemann sums, we obtain the linearity of the 
integral by passage to the limit. □ 


Remark 1 If we recall that the limit of the Riemann sums as a ( P ) — » 0 must be the 
same independently of the set of distinguished points £, we can conclude that 


(/ e 1Z(E )) f\(f(x) = 0 almost everywhere on 



f{x)dx = 0 . 


Therefore, if two integrable functions are equal at almost all points of E, then 
their integrals over E are also equal. Hence if we pass to the quotient space of 
1Z(E) obtained by identifying functions that are equal at almost all points of E, we 
obtain a vector space 7 Z(E) on which the integral is also a linear function. 


11.3.2 Additivity of the Integral 

Although we shall always be dealing with admissible sets fcR", this assumption 
was dispensable in Sect. 11.3.1 (and we dispensed with it). From now on we shall 
be talking only of admissible sets. 

Proposition 2 Let E\ and £3 be admissible sets in R” and f a function defined on 
£ l U £ 2 . 

a) The following relations hold : 



=>3 [ f(x) dx. 
J E\C\E2 
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b) If in addition it is known that fi(E i fl Et) = 0, the following equality holds 
when the integrals exist : 

[ fix) dx = f fix) dx + f fix) dx. 

J E\\JE2 J E\ J E 2 

Proof Assertion a) follows from Lebesgue’s criterion for existence of the Riemann 
integral over an admissible set (Theorem 1 of Sect. 1 1.2). Here it is only necessary 
to recall that the union and intersection of admissible sets are also admissible sets 
(Lemma 2 of Sect. 11.2). 

To prove b) we begin by remarking that 


XEiUE 2 = XEfx) + XE 2 (x) - XEinE 2 (x)- 


Therefore, 


f fix) dx=f 
J EIUE 2 J I 

-l 


fXE!UE 2 ix)dx = 

Jr 


i^e x \je 2 

fXEi (x) dx + I f xe 2 (x) dx - Ji f XEiC\e 2 ix) dx = 


= / f(x)dx+ / fix) dx. 

J E 1 J E 2 

The essential point is that the integral 


/ fXEirE 2 ix)dx — / fix) dx 
J I J E\ D£ 2 


as we know from part a), exists; and since /r(Zs 1 fl Ef) = 0, it equals zero (see 
Remark 1). □ 


11.3.3 Estimates for the Integral 


a. A General Estimate 


We begin with a general estimate of the integral that is also valid for functions with 
values in any complete normed space. 


Proposition 3 If f e THE), then \ f\ e THE), and the inequality 


j fix) dx 


< J e \f\(x)dx 


holds. 
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Proof The relation | /| e R( E) follows from the definition of the integral over a set 
and the Lebesgue criterion for integrability of a function over an interval. 

The inequality now follows from the corresponding inequality for Riemann sums 
and passage to the limit. □ 


b. The Integral of a Nonnegative Function 

The following propositions apply only to real-valued functions. 

Proposition 4 The following implication holds for a function f : E — > R: 

(/ e THE)) a (V* e E (fix) > 0 )) =* f fix) dx > 0. 

Proof Indeed, if fix) > 0 on E, then fxEix ) > 0 in M" . Then, by definition. 


f fix) dx = [ 

Je Jide 


fXEix) dx. 


This last integral exists by hypothesis. But it is the limit of nonnegative Riemann 
sums and hence nonnegative. □ 

From Proposition 4 just proved, we obtain successively the following corollaries. 

Corollary 1 

(f,g&1l{E)) A if <g on E)^(^J / (x ) dx < J g(x)dxj. 

Corollary 2 If f € RXE) and the inequalities m < fix) < M hold at every point 
of the admissible set E, then 


mp(E) 


4 


fix) dx < Mp{E). 


Corollary 3 If f & ’R.(E), m = inf xe E fix), and M = sup x€£ fix), then there is a 
number 9 e [m, M] such that 


L 


/(x)dx = 9p(E). 


Corollary 4 If E is a connected admissible set and the function f e RiE) is con- 
tinuous, then there exists a point '£, € E such that 


L 


fix)dx = fii;)niE). 
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Corollary 5 If in addition to the hypotheses of Corollary 2 the function g e 7 Z{E) 
is nonnegative on E, then 



Corollary 4 is a generalization of the one-dimensional result and is usually called 
by the same name, that is, the mean-value theorem for the integral. 

Proof Corollary 5 follows from the inequalities mg(x ) < fix)g(x) < Mg(x ) tak- 
ing account of the linearity of the integral and Corollary 1. It can also be proved 
directly by passing from integrals over E to the corresponding integrals over an in- 
terval, verifying the inequalities for the Riemann sums, and then passing to the limit. 
Since all these arguments were carried out in detail in the one-dimensional case, we 
shall not give the details. We note merely that the integrability of the product / • g 
of the functions / and g obviously follows from Lebesgue’s criterion. □ 

We shall now illustrate these relations in practice, using them to verify the fol- 
lowing very useful lemma. 

Lemma a) If the integral of a nonnegative function /:/—>• K over the interval I 
equals zero, then fix) = 0 at almost all points of the interval I . 

b) Assertion a) remains valid if the interval I in it is replaced by any admissible 
( Jordan-measurable ) set E. 

Proof By Lebesgue’s criterion the function / e 1Z(E) is continuous at almost all 
points of the interval / . For that reason the proof of a) will be achieved if we show 
that fia) = 0 at each point of continuity a e I of the function /. 

Assume that f(a) > 0. Then fix) > c > 0 in some neighborhood Ui(a) of a 
(the neighborhood may be assumed to be an interval). Then, by the properties of the 
integral just proved, 



I JUi(a) Jl\U,(a ) Ju,(a) 


This contradiction verifies assertion a). If we apply this assertion to the func- 
tion fxE and take account of the relation /r(3 E) = 0, we obtain assertion b). □ 

Remark 2 It follows from the lemma just proved that if £ is a Jordan-measurable 
set in E" and R,( E) is the vector space considered in Remark 1, consisting of equiv- 
alence classes of functions that are integrable over E and differ only on sets of 
Lebesgue measure zero, then the quantity ||/|| = f E |/|(x) dx is a norm on 1Z(E). 


Proof Indeed, the inequality f E \f\(x) dx = 0 now implies that f is in the same 
equivalence class as the identically zero function. □ 
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11.3.4 Problems and Exercises 


1. Let £ be a Jordan-measurable set of nonzero measure, / : £ -> R a continuous 
nonnegative integrable function on £, and M — sup ts£: fix). Prove that 


lim 

ft— >CO 


f E f"(x) d-v) ' = M. 


2. Prove that if /, g e '£.(£), then the following are true, 
a) Holder’s inequality 

i/p / r \ V? 


J (f-g)(x)dx 


J \f\ p (x)dx 4 S j (^J \g\ q (x)dx 


where p > 1 , q > 1 , and -j- + \ — 1 ; 
b) Minkowski's inequality 

1/P 


\f + g\ P &* ) <L \f\ p (x)dx ) <L \g\ p (x) dx 


i /p 


if p > 1 . 

Show that 

c) the preceding inequality reverses direction if 0 < p < 1 ; 

d) equality holds in Minkowski’s inequality if and only if there exists X > 0 such 
that one of the equalities / = Xg or g — Xf holds except on a set of measure zero 
in £; 

e) the quantity \\f\\ p = (^y f E \f\ p (x)dx) 1/p , where /x(£) > 0, is a mono- 
tone function of p e R. and is a norm on the space 1Z(E) for p > 1 . 

Find the conditions under which equality holds in Holder’s inequality. 

3. Let £ be a Jordan-measurable set in R" with // ( £ ) > 0. Verify that if q> e 
C (£, R) and / : R — > R is a convex function, then 

f( [ q>(x)dx | < f (fo(p)(x)dx. 

j- 11(E) J e j 


4. a) Show that if £ is a Jordan-measurable set in R" and the function / : £ -> R 

is integrable over £ and continuous at an interior point a e £, then 


1 


lim ^ 

s^+0 n(U s E (a)) 



fix) dx = f(a), 


where, as usual, U E (a) is the 5 -neighborhood of the point in £. 

b) Verify that the preceding relation remains valid if the condition that a is an 
interior point of £ is replaced by the condition ii(U E (a)) > 0 for every S > 0. 
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11.4 Reduction of a Multiple Integral to an Iterated Integral 
11.4.1 Fubini’s Theorem 

Up to now, we have discussed only the definition of the integral, the conditions 
under which it exists, and its general properties. In the present section we shall 
prove Fubini’s theorem, 4 5 which, together with the formula for change of variable, is 
a tool for computing multiple integrals. 

Theorem 5 Let X x Y be an interval in R m+ ", which is the direct product of inter- 
vals X C R m and Y C R n . If the function / :Xxf^Rij integrable over X x Y , 
then all three of the integrals 


f 

JXxY 


f (x, y) d.v dy, 



f(x,v)dy, 



f(x,y)dx 


exist and are equal. 

Before taking up the proof of this theorem, let us decode the meaning of the sym- 
bolic expressions that occur in the statement of it. The integral f XxY /(■*> )0 dx dy 
is the integral of the function / over the set X x Y , which we are familiar 
with, written in terms of the variables x e X and y e Y . The iterated integral 
f x dr fy f (x , y ) dy should be understood as follows: For each fixed x e X the in- 
tegral F(x) = f Y f(x, y)dy is computed, and the resulting function F : X — »• R. 
is then to be integrated over X. If, in the process, the integral f y f (x , y)dy does 
not exist for some x e X, then F (x ) is set equal to any value between the lower 
and upper Darboux integrals J_(x) — fy fix, y) dy and J (x) = f y f(x, y) dy, in- 
cluding the upper and lower integrals J (x) and J (x) themselves. It will be shown 
that in that case F e 1Z(X). The iterated integral f y dy f x f(x, y) dr has a similar 
meaning. 

It will become clear in the course of the proof that the set of values of x e X at 
which J_(x) f- 3 (x ) is a set of m -dimensional measure zero in X. 

Similarly, the set of y e Y at which the integral f x f (x , y)d.r may fail to exist 
will turn out to be a set of n -dimensional measure zero in Y . 

We remark finally that, in contrast to the integral over an (m + n) -dimensional 
interval, which we previously agreed to call a multiple integral, the successively 


4 G. Fubini (1870-1943) - Italian mathematician. His main work was in the area of the theory of 
functions and geometry. 

5 This theorem was proved long before the theorem known in analysis as Fubini’s theorem, of which 
it is a special case. However, it has become the custom to refer to theorems making it possible to 
reduce the computation of multiple integrals to iterated integrals in lower dimensions as theorems 
of Fubini type, or, for brevity, Fubini’s Theorem. 
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computed integrals of the function fix, y) over Y and then over X or over X and 
then over Y are customarily called iterated integrals of the function. 

If X and Y are closed intervals on the line, the theorem stated here theoreti- 
cally reduces the computation of a double integral over the interval X x Y to the 
successive computation of two one-dimensional integrals. It is clear that by apply- 
ing this theorem several times, one can reduce the computation of an integral over 
a ^'-dimensional interval to the successive computation of k one-dimensional inte- 
grals. 

The essence of the theorem we have stated is very simple and consists of the 
following. Consider a Riemann sum JT • f(Xj , yj)\Xj \ ■ Yj | corresponding to a 
partition of the interval X x Y into intervals X , x Yj . Since the integral over the 
interval X xY exists, the distinguished points can be chosen as we wish, and we 
choose them as the “direct product” of choices Xj e X, C X and yj e Yj e Y. We 
can then write 

22 fto'yj) l x «'l • \ Y A = 22 l z 'l 22 f(xi’yj)\ Y j\ = 2>>|£ f(xi,yj)\Xj\, 
i,j i j j i 

and this is the prelimit form of theorem. 

We now give the formal proof. 

Proof Every partition P of the interval X x Y is induced by corresponding partitions 
Px and Py of the intervals X and Y . Here every interval of the partition P is the 
direct product X; x Yj of certain intervals X, and Yj of the partitions Px and Py 
respectively. By properties of the volume of an interval we have \Xj x Yf = \Xj\ ■ 
\Y j\, where each of these volumes is computed in the space R" !+ ", ]R" ! , or R” in 
which the interval in question is situated. 

Using the properties of the greatest lower bound and least upper bound and the 
definition of the lower and upper Darboux sums and integrals, we now carry out the 
following estimates: 

s(f, P) = 22 inf /(*. y)l*»- x Yj \ < V inf (V inf f(x, y)|F / |V,| < 

z — 1 xeXi z — ' xe Xi\* — 'yeYi ) 

•J yeYj i j 

< 22 H ( f /(T’^dyYx/l < 22 in | < 

xeXj \Jy J x&Xi 

< 22 SU P F(x)\Xj\ < 22 sup ( f f(x,y)dy]\Xi \ < 

j xe Xi ! xeXi \ Jy / 
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Since / e RAX x Y ), both of the extreme terms in these inequalities tend to the 
value of the integral of the function over the interval X x Y as X(P) —*■ 0. This fact 
enables us to conclude that F e RAX) and that the following equality holds: 

j /(x,y)dxdy = f C(x)dx. 

JxxY ’ ’ JX 

We have carried out the proof for the case when the iterated integration is carried 
out first over Y , then over X. It is clear that similar reasoning can be used in the case 
when the integration over X is done first. □ 


11.4.2 Some Corollaries 


Corollary 1 If f & R(X x Y), then for almost all x e X (in the sense ofLebesgue) 
the integral f x /( x, y) dy exists, and for almost all y ef the integral f x f (x, y) dx 
exists. 

Proof By the theorem just proved, 


/ (/ X ’ ^ ^ d;v ) d * = °- 

But the difference of the upper and lower integrals in parentheses is nonnegative. 
We can therefore conclude by the lemma of Sect. 11.3 that this difference equals 
zero at almost all points x e X. 

Then by the Darboux criterion (Theorem 3 of Sect. 11.1) the integral f Y f(x,y)dy 
exists for almost all values of x e X. 

The second half of the corollary is proved similarly. □ 

Corollary 2 If the interval I C R” is the direct product of the closed intervals I ) = 
[a 1 , b‘ ] , i = 1 , . . . , n , then 

■" b"- 1 pb l 

dx"- 1 ... / f(x\x 2 ,...,x n ) dx 1 . 

Ja 1 

Proof This formula obviously results from repeated application of the theorem just 
proved. All the inner integrals on the right-hand side are to be understood as in 
the theorem. For example, one can insert the upper or lower integral sign through- 
out. □ 



Example 1 Let f(x, y,z) — Z sin(x + y). We shall find the integral of the restriction 
of this function to the interval I C R 3 defined by the relations 0<x <tt, |y| <7r/2, 
0 < z < 1 . 


132 


1 1 Multiple Integrals 


By Corollary 2 

fix,y, z)dx dy dz = 


Iff, 


Z sin(x + y) dx = 


/* 1 pJT / Z /» 71 

/ dz / dy / 

J0 J-jr/2 JO 

/•i r*/ 2 n 

dz (— z cos(x + y) | 0 ) dy = 

Jo J-tt/2 

r l rw/2 

J dz J 2 z cos y dy = 


-jr/2 


fj 2zsi ° y \y--^ )dz = f!- 


: = 4zdz = 2. 
Jo 

The theorem can also be used to compute integrals over very general sets. 


Corollary 3 Let D be a bounded set in M" 1 and E — {(x, y) e R" | (x e D) A 
iq>\ (x) < y < ^(Jt))}- If f e 7£(£), then 



fix, y) dy. 


( 11 . 1 ) 


Proof Let E x = {y e K | <pi(x) < y < ^(jc)} if x e Z) and E x = 0 if x ^ Z). We 
remark that xe(x, y) = xd(x) ■ xe x (y). Recalling the definition of the integral over 
a set and using Fubini’s theorem, we obtain 


L 


L 


fix, y)dxdy = / // £ (x, y)dxd.v = 

IDE 


/ ^ / 
J/ x dz> J/. 

L(l, 

/(/ 

J I x \J (p 

HI. 


fXE(x,y)dy = 

_DE X 

fix, y)XE x iy)dy\xDix)dx = 

nix) 

nix) 
nix) 


/(x, y) dy l/£,(x)dx = 
/(x,y)dy N )dx. 


The inner integral here may also fail to exist on a set of points in D of Lebesgue 
measure zero, and if so it is assigned the same meaning as in the theorem of Fubini 
proved above. □ 


Remark If the set D in the hypotheses of Corollary 3 is Jordan-measurable and the 
functions <pt : D — »■ R, i = 1 , 2, are continuous and bounded, then the set E C R” is 
Jordan measurable. 
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Proof The boundary dE of E consists of the two graphs of the continuous functions 
cpi : D — »■ R, i = 1, 2, (which by Example 2 of Sect. 11.1) are sets of measure zero) 
and the set Z, which is a portion of the product of the boundary dD of D C K"" 1 
and a sufficiently large one-dimensional closed interval of length /. By hypothesis 
3 D can be covered by a system of (n — 1) -dimensional intervals of total (n — 11- 
dimensional volume less than s/I. The direct product of these intervals and the 
given one-dimensional interval of length / gives a covering of Z by intervals whose 
total volume is less than s. □ 


Because of this remark one can say that the function / : E — > 1 e E is integrable 
on a measurable set E having this structure (as it is on any measurable set E). 
Relying on Corollary 3 and the definition of the measure of a measurable set, one 
can now derive the following corollary. 


Corollary 4 If under the hypotheses of Corollary 3 the set D is Jordan-measurable 
and the functions tpi : D — > R, / = 1, 2, are continuous, then the set E is measurable 
and its volume can be computed according to the formula 

p,(E)= f (<p 2 (x) - tpi(x))dx. (11.2) 

Jd 


Example 2 For the disk E — {(x, y) e R 2 | x 2 + y 2 < r 2 } we obtain by this formula 
M(£') = / (-J r 2 — y 2 — (rsjr 2 -y 2 )) dy = 2 J Jr 2 -y 2 dy = 


= 4 f Jr 2 — y 2 dy = 4 f 
Jo v Jo 

f 


r cos<pd(r sin<p) = 


= 4r 


r cos 2 tpdtp = : rr 2 . 


Corollary 5 Let E be a measurable set contained in the interval / C R" . Represent 
1 as the direct product I = I x x I y of the ( n — 1) -dimensional interval I x and the 
closed interval / v . Then for almost all values yo e I y the section E Vq = {(x, y) e E \ 
y = yo} of the set E by the ( n — 1) -dimensional hyperplane y = yo is a measurable 
subset of it, and 

p(E)=f ii(E y )dy, (11.3) 

Jly 

where p(E y ) is the (n — 1) -dimensional measure of the set E y if it is measurable and 
equal to any number between the numbers fp- 1 ■ dx and f E 1 • dx if E y happens to 
be a nonmeasurable set. 


Proof Corollary 5 follows immediately from the theorem and Corollary 1, if we set 
f — y E in both of them and take account of the relation xe(x, y) = XE, lx). □ 
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A particular consequence of this result is the following. 

Corollary 6 (Cavalieri’s 6 principle) Let A and B be two solids in R 3 having volume 
( that is, Jordan-measurable). Let A c = {( x , y, z) e A \ z = c} and B c = {(x, y, z ) e 
B \ z = c} be the sections of the solids A and B by the plane z = c. If for every cel 
the sets A c and B c are measurable and have the same area, then the solids A and 
B have the same volumes. 


It is clear that Cavalieri’s principle can be stated for spaces R" of any dimension. 


Example 3 Using formula (11.3), let us compute the volume V n of the ball B — 
{x e R" | |x| < r} of radius r in the Euclidean space R". 

It is obvious that V\ = 2. In Example 2 we found that Vi — ttr 2 . We shall show 
that V n — c n r n , where c n is a constant (which we shall compute below). Let us 
choose some diameter [— r, r] of the ball and for each point x e [—r, r] consider 
the section B x of the ball B by a hyperplane orthogonal to the diameter. Since B x 
is a ball of dimension n — 1, whose radius, by the Pythagorean theorem, equals 
V r 2 — x 2 , proceeding by induction and using (1 1.3), we can write 

V n = J c„-i(r 2 — x 2 ) 2 dx=^c„_i J cos" tp dcp^j r n . 

(In passing to the last equality, as one can see, we made the change of variable 
x = r sin<£>.) 

Thus we have shown that V n = c n r n , and 


c n =c n - 1 / cos" tp dip. 

J-tt/2 


(11.4) 


We now find the constant c„ explicitly. We remark that for m > 2 

f*n/ 2 /» 77" /2 


Itn — 


-7T/2 
= Im — 2 


/ 7Z/A rn/A 

cos m (pd(p— / cos'" -- ip(\ — simp) dip = 

-nil J-tc/2 

i r ' 2 

~\ / S1 

1 J —jz/2 


simp dcos m 1 tp — I m -2 ■ 


m — 1 


that is, the following recurrence relation holds: 


m — 1 

I m = Im— 2 - 


(11.5) 


6 B. Cavalieri (1598-1647) - Italian mathematician, the creator of the so-called method of indivisi- 
bles for determining areas and volumes. 
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In particular, h — it /2. It is clear immediately from the definition of /„, that 
I\ — 2. Taking account of these values of I\ and I 2 we find by the recurrence for- 
mula (1 1.5) that 


(2k)!! (2k- 1)!! 

I2k + 1 = — ' 2, hk — T7— : — 71. 


(2k + 1)!! 

Returning to formula (11.4), we now obtain 


(2k)!! 


( 11 . 6 ) 


(2k)!! (2k)!! (2k -1)!! (2n) k 

cik+\ — C 2 k ~, , , x .. ■ 2 — C 2 k-\ — , • — TTrrrr. — 11 = --- = c\- 


(2k + 1)!! 
(2k- 1)!! 


(2k +1)!! (2k)!! 

(2k -1)!! (2k -2)!! 


(2k +1)!!’ 


C2k — C2k — \ 7X = Cik—i 71 ■ ■ 2 = 

(2k)!! ' (2k)!! (2k - 1)!! 

(2 7t) k ~ l 

— ... = C2 ■ 2. 

(2k)!! 

But, as we have seen above, c\ = 2 and cj = tt , and hence the final formulas for 
the required volume V„ are as follows: 


v o 2k+l 

V2k + 1 = 2— 7-rrr 




(2k +1)!! (2k)!! 

where k e N, and the first of these formulas is also valid for k — 0. 


(11-7) 


11.4.3 Problems and Exercises 

1. a) Construct a subset of the square / cl 2 such that on the one hand its inter- 
section with any vertical line and any horizontal line consists of at most one point, 
while on the other hand its closure equals I . 

b) Construct a function /:/—>• M for which both of the iterated integrals that 
occur in Fubini’s theorem exist and are equal, yet / ^ 7 Z(I). 

c) Show by example that if the values of the function F(x) that occurs in Fu- 
bini’s theorem, which in the theorem were subjected to the conditions J_(x) < 
F(x) < J (x) at all points where J (x) < J(x), are simply set equal to zero at 
those points, the resulting function may turn out to be nonintegrable. (Consider, 
for example, the function f(x,y) on R 2 equal to 1 if the point (x, y) is not ra- 
tional and to 1 — 1 /q at the point ( p/q.m/n ), both fractions being in lowest 
terms.) 

2. a) In connection with formula (11.3), show that even if all the sections of a 
bounded set E by a family of parallel hyperplanes are measurable, the set E may 
yet be nonmeasurable. 


136 


1 1 Multiple Integrals 


b) Suppose that in addition to the hypotheses of part a) it is known that the 
function n(E y ) in formula (11.3) is integrable over the closed interval I y . Can we 
assert that in this case the set E is measurable? 

3. Using Fubini’s theorem and the positivity of the integral of a positive function, 

give a simple proof of the equality for the mixed partial derivatives, 

assuming that they are continuous functions. 

4. Let / : l a b — > R be a continuous function defined on an interval I a .b — {x e 
R" | a 1 <x l < b l , i = 1, . . . , n], and let F : I a b —*■ R be defined by the equality 

F{x)=f fit) At, 

JIa,x 


where I a x C I a ,b- Find the partial derivatives of this function with respect to the 
variables x 1 , . . . , x". 

5. Let f{x,y) be a continuous function defined on the rectangle I — [a. b] x 

[c, d ] C R 2 , which has a continuous partial derivative in I . 

a) Let F(y) = f f(x,y)dx. Starting from the equality F(y) = 
/ f v j^(x, t) dt + fix, c)) dx, verify the Leibniz rule, according to which F'iy) = 

la %ix,y)dx. 

b) Let Gix, y) = f* fit, y)dt. Find and 

c) Let H (y) = f^ <y) fix, y)d.r, where h & C^[a,b\. Find H'(y). 

6. Consider the sequence of integrals 

Foix)= [ fiy) dy, F„ix) = f — — P~fiy)dy, n e N, 

Jo Jo «' 

where / e C(R, R). 

a) Verify that F'fx) — F„- i(x), F,[ k) ( 0) = 0 if k < n, and F, { ," +l \x) — fix). 

b) Show that 




fix n ) d-t„ 


( X ix-y) n fiy)dy . 
n\ Jo 


1. a) Let / : E — »■ R be a function that is continuous on the set E — {(x, y) e R 2 | 

0 < x < 1 A 0 < y < x}. Prove that 



fix,y)dy 



fix, y) dx. 


b) Use the example of the iterated integral / 0 ?7r dx f^ mx 1 ■ dy to explain why not 
every iterated integral comes from a double integral via Fubini’s theorem. 
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11.5 Change of Variable in a Multiple Integral 

11.5.1 Statement of the Problem and Heuristic Derivation 
of the Change of Variable Formula 


In our earlier study of the integral in the one-dimensional case, we obtained an 
important formula for change of variable in such an integral. Our problem now is to 
find a formula for change of variables in the general case. Let us make the question 
more precise. 

Let D x be a set in R" , / a function that is integrable over D x , and <p> : I), — > D x 
a mapping t i-* (p(t) of a set D t C M" onto D x . We seek a rule according to which, 
knowing / and <p, we can find a function i// in D, such that the equality 



i fit) dr 


holds, making it possible to reduce the computation of the integral over D x to the 
computation of an integral over D t . 

We begin by assuming that D t is an interval I C R" and <p : I -+ D x a diffeomor- 
phism of this interval onto D x . To every partition P of the interval I into intervals 
I\, I 2 , ■ ■ . , h there corresponds a partition of D x into the sets (p(I ]), i = \ k. 
If all these sets are measurable and intersect pairwise only in sets of measure zero, 
then by the additivity of the integral we find 



fix) dx = 



fix) Ax. 


( 11 . 8 ) 


If / is continuous on D x , then by the mean- value theorem 



mMviii)), 


where e (p(Ij). Since f if i) — ficpizi)), where r,- = q> 1 iff), we need only con- 
nect pi((p(Ii)) with nif). 

If ip were a linear transformation, then (pi f ) would be a parallelepiped whose 
volume, as is known from analytic geometry, would be | det(p'\fi(Ij). But a diffeo- 
morphism is locally a nearly linear transformation, and so, if the dimensions of the 
intervals 7/ are sufficiently small, we may assume /r(^(7,)) | det<//(r,-)| | /,• | with 

small relative error (it can be shown that for some choice of the point r, e 7, actual 
equality will result). Thus 



k 

fix)dx^'Y^f{(p(r i ))\ det <p' (r,- ) 1 1 f \ . 
i = 1 


( 11 . 9 ) 
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But, the right-hand side of this approximate equality contains a Riemann sum for 
the integral of the function f(tp(t))\ det^'(?)| over the interval I corresponding to 
the partition P of this interval with distinguished points r . In the limit as /,( P) — »• 0 
we obtain from (11.8) and (11.9) the relation 



This is the desired formula together with an explanation of it. The route just 
followed in obtaining it can be traversed with complete rigor (and it is worthwhile 
to do so). However, in order to become acquainted with some new and useful general 
mathematical methods and facts and avoid purely technical work, we shall depart 
from this route slightly in the proof below. 

We now proceed to precise statements. We recall the following definition. 

Definition 1 The support of a function f : /) — > R defined in a domain D C R" is 
the closure in D of the set of points of x e D at which fix) f- 0. 

In this section we shall study the situation when the integrand / : D x — * R equals 
zero on the boundary of the domain D x , more precisely, when the support of the 
function / (denoted supp /) is a compact set 7 K contained in D x . The integrals of 
/ over D x and over K, if they exist, are equal, since the function equals zero in D x 
outside of K. From the point of view of mappings the condition supp / = K C D x 
is equivalent to the statement that the change of variable x = <p(t) is valid not only in 
the set K over which one is essentially integrating, but also in some neighborhood 
D x of that set. 

We now state what we intend to prove. 

Theorem 1 If tp : D t — >■ D x is a dijfeomorphism of a bounded open set D, C R" 
onto a set D x = tp(D t ) C R” of the same type, f e 7 Z(D X ), and supp / is a compact 
subset of D x , then f o tp\ det^/| e 7 Z{D t ), and the following formula holds: 



( 11 . 10 ) 


11.5.2 Measurable Sets and Smooth Mappings 

Lemma 1 Let <p : D, —x D x be a dijfeomorphism of an open set D, C R" onto a set 
D x C R" of the same type. Then the following assertions hold. 

a) If E, C D t is a set of (Lebesgue) measure zero, its image C D x is also 

a set of measure zero. 

7 Such functions are naturally called functions of compact support in the domain. 
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b) If a set E, contained in D, along with its closure E, has Jordan measure 
zero, its image <p(E,) — E x is contained in D x along with its closure and also has 
measure zero. 

c) If a ( Jordan ) measurable set E t is contained in the domain D , along with its 
closure E t , its image E x = (p{E t ) is Jordan measurable and E x C D x . 

Proof We begin by remarking that every open subset D in R" can be represented 
as the union of a countable number of closed intervals (no two of which have any 
interior points in common). To do this, for example, one can partition the coordinate 
axes into closed intervals of length A and consider the corresponding partition of R" 
into cubes with sides of length A. Fixing A = 1, take the cubes of the partition 
contained in D. Denote their union by F\. Then taking A — 1/2, adjoin to /-) the 
cubes of the new partition that are contained in D\F\. In that way we obtain a 
new set Fi, and so forth. Continuing this process, we obtain a sequence F\ C • • • C 
F n C • ■ ■ of sets, each of which consists of a finite or countable number of intervals 
having no interior points in common, and as one can see from the construction, 
U Fn = D. 

Since the union of an at most countable collection of sets of measure zero is a 
set of measure zero, it suffices to verify assertion a) for a set E t lying in a closed 
interval I C D,. We shall now do this. 

Since q> e C (l> (I) (that is, q> ' e C(I)), there exists a constant M such that 
\\(pft)\\ < M on /. By the finite-increment theorem the relation \x 2 — X] \ < M\t -2 — 
t\ | must hold for every pair of points ti,t 2 € I with images x\ — (p(t i), X 2 = q>(j 2 ). 

Now let {/,-} be a covering of E, by intervals such that i\h\<e. Without loss 
of generality we may assume that /, = /,- fi / C /. 

The collection {<£>(/;)} of sets <p(Ij) obviously forms a covering of E x = q>(E,). 
If tj is the center of the interval /, , then by the estimate just given for the possible 
change in distances under the mapping <p, the entire set <£>(/;) can be covered by the 
interval I , with center x, = —(pit , ) whose linear dimensions are M times those of 
the interval I , . Since 1 7,j = M n 141, and <p(E,) C (J,- /; , we have obtained a covering 
of ip(E t ) = E x by intervals whose total volume is less than M n e. Assertion a) is now 
established. 

Assertion b) follows from a) if we take into account the fact that E t (and hence 
by what has been proved, E x = cp(E t ) also) is a set of Lebesgue measure zero and 
that E, (and hence also E x ) is a compact set. Indeed, by Lemma 3 of Sect. 11.1 
every compact set that is of Lebesgue measure zero also has Jordan measure zero. 

Finally, assertion c) is an immediate consequence of b), if we recall the definition 
of a measurable set and the fact that interior points of E t map to interior points of 
its image E x = (p(E t ) under a diffeomorphism, so that 3 E x — (p(8E t ). □ 

Corollary Under the hypotheses of the theorem the integral on the right-hand side 
of formula (11.10) exists. 

Proof Since | det</(f)| f 0 in D r , it follows that supp(/ o <p ■ \ det^/|) = supp(/ o 
<p) — (p~ 1 (supp /) is a compact subset in D, . Hence the points at which the function 
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/ o q> ■ | 6&i(p'\xD t in R" is discontinuous have nothing to do with the function XD t . 
but are the pre-images of points of discontinuity of / in D x . But / e 1Z(D X ), and 
therefore the set E x of points of discontinuity of / in D x is a set of Lebesgue 
measure zero. But then by assertion a) of the lemma the set E t — (p~U E x ) has 
measure zero. By Lebesgue’s criterion, we can now conclude that f ocp-\ det (p'\xD, 
is integrable on any interval I t D D,. □ 


11.5.3 The One -Dimensional Case 

Lemma 2 a) If tp : I t —*■ I x is a diffeomorphism of a closed interval I t C R 1 onto a 
closed interval I x C R 1 and f e 1Z(I X ), then f o tp ■ \<p'\ e lZ(I t ) and 



( 11 . 11 ) 


b) Formula (11.10) holds in R 1 . 

Proof Although we essentially already know assertion a) of this lemma, we shall 
use the Lebesgue criterion for the existence of an integral, which is now at our 
disposal, to give a short proof here that is independent of the proof given in Part 1 . 

Since f e TZ(I X ) and tp : I t -» 1 x is a diffeomorphism, the function / o tp\cp'\ is 
bounded on I t . Only the pre-images of points of discontinuity of / on I x can be 
discontinuities of the function / o <p\tp'\. By Lebesgue’s criterion, the latter form a 
set of measure zero. The image of this set under the diffeomorphism tp~ l : I x — ► I t , 
as we saw in the proof of Lemma 1, has measure zero. Therefore / o (p\tp'\ e lZ(I r ). 

Now let P x be a partition of the closed interval l x . Through the mapping tp~ l 
it induces a partition P, of the closed interval /, , and it follows from the uniform 
continuity of the mappings tp and cp~ 1 that X(P x ) —> 0 O k(P t ) — >• 0. We now write 
the Riemann sums for the partitions P x and P t with distinguished points f — (p(ri): 


Y,fmxi -xi~ ii = / o <p(.tj)\<p(.ti) - <p(tj- 1)| 


i i 


= XI / ° V' 1 1 ti - ti- 1 1 , 


and the points can be assumed chosen just so that §,■ =tp(xj), where r ; - is the point 
obtained by applying the mean- value theorem to the difference q>(ti) — q>(ti~ i). 

Since both integrals in (11.11) exist, the choice of the distinguished points in 
the Riemann sums can be made to suit our convenience without affecting the limit. 
Hence from the equalities just written for the Riemann sums, we find (11.11) for the 
integrals in the limit as X(P x ) — > 0(X(P,) — >■ 0). 
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Assertion b) of Lemma 2 follows from Eq. (11.11). We first note that in the one- 
dimensional case | det<p'| = \q>'\. Next, the compact set supp / can easily be covered 
by a finite system of closed intervals contained in D x , no two of which have common 
interior points. The integral of / over D x then reduces to the sum of the integrals 
of / over the intervals of this system, and the integral of / o (p\(p'\ over D, reduces 
to the sum of the integrals over the intervals that are the pre-images of the intervals 
in this system. Applying Eq. (1 1.1 1) to each pair of intervals that correspond under 
the mapping q> and then adding, we obtain (11.10). □ 

Remark 1 The formula for change of variable that we proved previously had the 
form 



(11.12) 


where qi was any smooth mapping of the closed interval [a, /l] onto the interval with 
endpoints cp(a) and <p(/3). Formula (11.12) contains the derivative <p' itself rather 
than its absolute value \<p'\. The reason is that on the left-hand side it is possible that 
< P(P ) < <P(o0- 

However, if we observe that the relations 


fa f( x ) d-L if a<b, 

— ff f(x ) d.r, if a > b, 



hold, it becomes clear that when cp is a diffeomorphism formulas (11.11) and (11.12) 
differ only in appearance; in essence they are the same. 

Remark 2 It is interesting to note (and we shall certainly make use of this observa- 
tion) that if <p : l t — > I x is a diffeomorphism of closed intervals, then the formulas 




for the upper and lower integrals of real-valued functions are always valid. 

Given that fact, we may take as established that in the one-dimensional case 
formula (1 1.10) remains valid for any bounded function / if the integrals in it are 
understood as upper or lower Darboux integrals. 

Proof We shall assume temporarily that / is a nonnegative function bounded by a 
constant M. 

Again, as in the proof of assertion a) of Lemma 2, one may take partitions P x 
and P t of the intervals l x and I t respectively that correspond to each other under the 
mapping tp and write the following estimates, in which e is the maximum oscillation 
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of tp on intervals of the partition P, : 

E SU P f(*)\Xi ~ Xj-[\< 

■ xeAxi 

< E sup f(<p(t)) sup \<p\t)\\ti - ti - ll < 

• teAti teAti 

< e su p (/(<K f )) • su p W(t)\)\Ati\ < 

l teAti v teAti ' 

< v su p - 

i teAti 

<E su p (f(<p(t))\<p'(t)\)\Ati\ +«E sup /(^(o)i^ii < 

i teAu ; teAn 

< E SU P (/(^(0)|^(0|)l^hl + eM|/,|. 

i teAn 

Taking account of the uniform continuity of tp we obtain from this the relation 

[ f(x)dx< f (f otp\(p'\)(t)dt 
Ji x Ji, 

as X(P t ) -> 0. Applying what has just been proved to the mapping tp~ 1 and the 
function / o (p\tp’\, we obtain the opposite inequality, and thereby establish the first 
equality in Remark 2 for a nonnegative function. But since any function can be 
written as / = max{ /, 0} — max}—/, 0} (a difference of two nonnegative functions) 
the equality can be considered to be established in general. The second equality is 
verified similarly. □ 

From the equalities just proved one can of course obtain once again assertion a) 
of Lemma 2 for real- valued functions /. 


11.5.4 The Case of an Elementary Diffeomorphism in W l 

Let tp : D, — »• D x be a diffeomorphism of a domain D, C R" onto a domain D x C 
R" with (f 1 , . . . , t n ) and (a: 1 , . . . , x n ) the coordinates of points t e M" and .r e R" 
respectively. We recall the following definition. 

Definition 2 The diffeomorphism tp : D t -» D x is elementary if its coordinate rep- 
resentation has the form 

x 1 =tp l (t l t n ) = t l . 
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X k ~' =(p k - l (t\...,t n ) = t k ~\ 

X k = <p k (t l , . . . ,t n ) = <p k (t\ . . . ,t k , . . . ,t n ), 

x k+l =<p k (t\...,t n )=t k+ \ 


x n =( P n (t 1 ,...,t n ) = t". 

Thus only one coordinate is changed under an elementary diffeomorphism (the 
kth coordinate in this case). 


Lemma 3 Formula (11.10) holds for an elementary diffeomorphism. 


Proof Up to a relabeling of coordinates we may assume that we are considering a 
diffeomorphism (p that changes only the nth coordinate. For convenience we intro- 
duce the following notation: 

(x\...,x n -\x n ) =: (. x,x n ); (f\ . . . , t n ~\ t n ) =: (7,t n ); 

D x n (jcb) := { (x, x n ) e D x \ x = x 0 } ; 

D t „<Jo) := {(?>”) e D t \ 7 = 7 0 }. 

Thus D x n (x) and D t n ( t ) are simply the one-dimensional sections of the sets D x 
and D, respectively by lines parallel to the nth coordinate axis. Let I x be an inter- 
val in E" containing D x . We represent I x as the direct product f x = /jx I x n of 
an (n — 1) -dimensional interval I x and a closed interval I x n of the nth coordinate 
axis. We give a similar representation I t = Iy x /,« for a fixed interval I, in R'' 
containing D t . 

Using the definition of the integral over a set, Fubini’s theorem, and Remark 2, 
we can write 


f fQc)dx= f f ■ XD x (x)dx = f dx f f-XD x (x,x n ) dx" = 
J Dx " Ix ” Ix ” I X ft 

= f dx f f(x, x n ) dx” = 

Jl 7 JD r n(x) 


(7,t")dt" = 


= f d 7 f f{ 7 y{ 7 r)) 

= [ d? f (/ o (p\deUp'\xD,)(7,t n ) df" = 

Jlj JI,n 

= [ (/°^|det^ , |/ Z 5,)(0dr= f (/o<p|det</|)(0 

J I, JD, 


df. 


144 


1 1 Multiple Integrals 


In this computation we have used the fact that det tp' = for the diffeomor- 


phism under consideration. 


□ 


11.5.5 Composite Mappings and the Formula for Change 
of Variable 


\f/ tp 

Lemma 4 If D z —> D t D x are two dijf e onto rphi sms for each of which formula 
(11. 10) for change of variable in the integral holds, then it holds also for the com- 
position tp o x[r : D t — ► D x of these mappings. 


Proof It suffices to recall that (tp o f)' = tp' of and that det (tp o fix) = 
det <^'(f ) dett/f'(r), where t — tp{x). We then have 

f f(x)dx= f (/ o £>|det<p , |) df = 

J D x J Df 

= / ((/ o tp o t/r ) | det o f 1 1 det f\){x) dr = 

= f {f ° (<P ° V f )|det(<p o | ) (r ) dr . Q 

J D x 


11.5.6 Additivity of the Integral and Completion of the Proof 
of the Formula for Change of Variable in an Integral 

Lemmas 3 and 4 suggest that we might use the local decomposition of any diffeo- 
morphism as a composition of elementary diffeomorphisms (see Proposition 2 from 
Sect. 8.6.4 of Part 1 ) and thereby obtain the formula (11 .10) in the general case. 

There are various ways of reducing the integral over a set to integrals over small 
neighborhoods of its points. For example, one may use the additivity of the integral. 
That is the procedure we shall use. On the basis of Lemmas 1,3, and 4 we now carry 
out the proof of Theorem 1 on change of variable in a multiple integral. 

Proof For each point t of the compact set K t = supp((/ o q>)\ det^'|) C D, we con- 
struct a <5(t) -neighborhood U (t) of it in which the diffeomorphism tp decomposes 
into a composition of elementary diffeomorphisms. From the fp- neighborhoods 
U(t) C U (0 °f the points t e K, we choose a finite covering U (t j), ...,[/ (tf) of 
the compact set K t . Let <5 = j min{<5(fi), . . . , <5(0)}. Then the closure of any set 
whose diameter is smaller than <5 and which intersects K, must be contained in at 
least one of the neighborhoods U(t\), ...,(/ (tf). 

Now let I be an interval containing the set D, and P a partition of the interval 
/ such that /. ( P ) < min{<5, d), where S was found above and d is the distance from 
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K, to the boundary of D,. Let 1 :={/;} be the intervals of the partition P that have 
a nonempty intersection with K t . It is clear that if /, e I, then /, C D t and 


f (/ o ^|det#/Q(f) dt = f ((fo<p\det<p'\)xD,)(t)dt 
Jd, J i 

-U (/ o^|det^|)(r)dr. 


(11.13) 


By Lemma 1 the image Ej = (pih) of the intervals /,■ is a measurable set. Then 
the set E — |J ( - E, is also measurable and supp / C E = E C D x . Using the addi- 
tivity of the integral, we deduce from this that 


f f(x) dx = [ 
Jd x Jr 

-L 


fXD x (x) dx = 


r xDD x 

fXD x (x) dx = 


L 


'x\E 

fix) dx = Ylf E 


fXD x ix) dx + J ^ f xd x ix) dx = 

fix) dx. (11.14) 


By construction every interval /, e I is contained in some neighborhood U ixj) 
inside which the diffeomorphism q> decomposes into a composition of elementary 
diffeomorphisms. Hence on the basis of Lemmas 3 and 4 we can write 


f fix) dx= [ (f o(p\dei(p'\)it)dt. 
JEj Jit 


(11.15) 


Comparing relations (11.13), (1 1.14), and (1 1.15), we obtain formula (11.10). □ 


11.5.7 Corollaries and Generalizations of the Formula for Change 
of Variable in a Multiple Integral 

a. Change of Variable Under Mappings of Measurable Sets 

Proposition 1 Let (p : If D x be a diffeomorphism of a bounded open set D, C 
R" onto a set D x C R" of the same type', let E t and E x be subsets of D t and D x 
respectively and such that E, C D t , E x C D x , and E x = (piE t ). If f e !ZiE x ), then 
f o ip\ det^'l e IZiEf), and the following equality holds : 
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Proof Indeed, 


f f(x)dx = [ (fxE x )(x)dx = [ (((/ xe x ) o <p) (det^' |)(f) dr = 

J Ex J Dx J Dt 

= f ((/°'P)|det<p , |x £( )(f)dr= [ ((/ o<p)|det<p'|)(f)dt. 

Jd, Je, 

In this computation we have used the definition of the integral over a set, formula 
(11.10), and the fact that xe, = XE x ° </?■ □ 


b. Invariance of the Integral 


We recall that the integral of a function / : E — > R over a set E reduces to comput- 
ing the integral of the function f xe over an interval I D E. But the interval I itself 
was by definition connected with a Cartesian coordinate system in R" . We can now 
prove that all Cartesian systems lead to the same integral. 

Proposition 2 The value of the integral of a function f over a set E C R" is inde- 
pendent of the choice of Cartesian coordinate system in R". 


Proof In fact the transition from one Cartesian coordinate system in R" to another 
Cartesian system has a Jacobian constantly equal to 1 in absolute value. By Propo- 
sition 1 this implies the equality 



f{x)dx = [ (/ o <p)(t)dt. 
Je, 


But this means that the integral is invariantly defined: if p is a point of E having 
coordinates x — (x 1 , . . . , x") in the first system and t = (t l , , t" ) in the second, 
and x = (p(t) is the transition function from one system to the other, then 

f(p) = ...,x n ) = f t (t l , ...,t n ). 


where f t = f x o cp. Hence we have shown that 



fx(x)dx 



MDdt, 


where E x and E, denote the set E in the v and t coordinates respectively. 


□ 


We can conclude from Proposition 2 and Definition 3 of Sect. 1 1 .2 for the (Jor- 
dan) measure of a set E C R" that this measure is independent of the Cartesian 
coordinate system in R", or, what is the same, that Jordan measure is invariant un- 
der the group of rigid Euclidean motions in R" . 
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c. Negligible Sets 


The changes of variable or formulas for transforming coordinates used in practice 
sometimes have various singularities (for example, one-to-oneness may fail in some 
places, or the Jacobian may vanish, or differentiability may fail). As a rule, these 
singularities occur on a set of measure zero and so, to meet the demands of practice, 
the following theorem is very useful. 


Theorem 2 Let <p : D, — > D x be a mapping of a ( Jordan ) measurable set D, C R” 
onto a set D x C R” of the same type. Suppose that there are subsets S t and S x 
of D t and D x respectively having ( Lebesgue ) measure zero and such that D t \S t 
and D X \S X are open sets and tp maps the former cliffeomorphically onto the latter 
and with a bounded Jacobian. Then for any function f e TZ(D X ) the function (/ o 
<p) \ det^'l also belongs to lZ(D t \S t ) and 


f f(x)6x= f ((/o*0|det*/|)(t)df. (11.17) 

JD X J D,\S, 

If in addition , the quantity | det^/| is defined and bounded in D t . then 

f f(x) dx= f ((/op)|deV|)(f)d/. (11.18) 

J D x J Df 


Proof By Lebesgue’s criterion the function / can have discontinuities in D x and 
hence also in D X \S X only on a set of measure zero. By Lemma 1, the image of this 
set of discontinuities under the mapping : D X \S X — ► D t \S t is a set of measure 
zero in D t \S t . Thus the relation (/ o q>)\ det^'| e lZ(D t \S t ) will follow immedi- 
ately from Lebesgue’s criterion for integrability if we establish that the set D r \S t is 
measurable. The fact that this is indeed a Jordan measurable set will be a by-product 
of the reasoning below. 

By hypothesis D X \S X is an open set, so that ( D X \S X ) fl dS x = 0. Hence dS x C 
ii D x IJ S x and consequently 3 D x U S x = 3 D x U S x , where S x = S x U 3 S x is the 
closure of S x in R". As a result, 3 D x U S x is a closed bounded set, that is, it is com- 
pact in R" , and, being the union of two sets of measure zero, is itself of Lebesgue 
measure zero. From Lemma 3 of Sect. 11.1 we know that then the set 3 D x U S x 
(and along with it, S x ) has measure zero, that is, for every e > 0 there exists a finite 
covering f, .... f of this set by intervals such that 1 /,- 1 < e. Hence it follows, 

in particular, that the set D X \S X (and similarly the set D t \S t ) is Jordan measurable: 
indeed, 3 (D X \S X ) C dD x U 3S X C 3 D x U S x . 

The covering 1\, ... can obviously also be chosen so that every point x e 
dD x \S x is an interior point of at least one of the intervals of the covering. Let 
U x = Uti A ■ The set U x is measurable, as is V x — D X \U X . By construction the set 
V x is such that V x C D X \S X and for every measurable set E x C D x containing the 
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compact set V x we have the estimate 


f f(x)dx- f f(x)dx 

— 

f f(x)dx 

J D x J E x 


Jd x \e x 


< M n(D x \E x ) < M ■ s, (11.19) 


where M = sup xeD ^ f(x). 

The pre-image V t — (p~ l (V x ) of the compact set V x is a compact subset of 
D t \S t . Reasoning as above, we can construct a measurable compact set W t subject 
to the conditions V t C W t C D t \S t and having the property that the estimate 


If ((/°<p)|det<p'|)(t)df - f ((/o<j£>)|det<p'|)(f) 
I Jd,\s, Je, 


dr 


< s 


(11.20) 


holds for every measurable set E, such that W, c E, C D r \S,. 

Now let E x = (p(E,). Formula (11.16) holds for the sets E x C D X \S X and E, C 
D,\S t by Lemma 1. Comparing relations (11.16), (11.19), and (11.20) and taking 
account of the arbitrariness of the quantity e > 0, we obtain (11.17). 

We now prove the last assertion of Theorem 2. If the function (/ o <p) \ det ys' | 
is defined on the entire set D,, then, since D,\S, is open in M”, the entire set of 
discontinuities of this function in D, consists of the set A of points of discontinuity 
of {f o(p)\ det^'l \d,\s, (the restriction of the original function to D,\S ,) and perhaps 
a subset B of S, U 3D, . 

As we have seen, the set A is a set of Lebesgue measure zero (since the integral 
on the right-hand side of (11.17) exists), and since S, U 3 D t has measure zero, the 
same can be said of B. Flence it suffices to know that the function (/ o <p)\ det<p'| is 
bounded on D t ; it will then follow from the Lebesgue criterion that it is integrable 
over D,. But \f o <p\{t) < M on D t , so that the function (/ o <p)| det<^/| is bounded 
on S t , given that the function | det<^/| is bounded on S, by hypothesis. As for the set 
D t \S t , the function ( f o(p) \ det <// 1 is integrable over it and hence bounded. Thus, the 
function (/ o q>) \ det qj' is integrable over D t . But the sets I), and D t \S t differ only 
by the measurable set S t , whose measure, as has been shown, is zero. Therefore, 
by the additivity of the integral and the fact that the integral over S t is zero, we can 
conclude that the right-hand sides of (11.17) and (11.18) are indeed equal in this 
case. □ 


Example The mapping of the rectangle I = {(r, (p) e R 2 | 0 < r < R A 0 < <p < 2jt} 
onto the disk K — {(x, y) e R 2 | x 2 + y 2 < R 2 } given by the formulas 

x — rcoscp, y = rsin^, (11.21) 

is not a diffeomorphism: the entire side of the rectangle I on which r — 0 maps to 
the single point (0, 0) under this mapping; the images of the points (r, 0) and (r, 2n) 
are the same. However, if we consider, for example, the sets 7\37 and K\E, where 
E is the union of the boundary 3 K of the disk K and the radius ending at (0, R ), 


1 1 .5 Change of Variable in a Multiple Integral 


149 


then the restriction of the mapping (11.21) to the domain I\dl turns out to be a 
diffeomorphism of it onto the domain K\E. Hence by Theorem 2, for any function 
f elZ(K) we can write 



and, applying Fubini’s theorem 



Relations (11.21) are the well-known formulas for transition from polar coordi- 
nates to Cartesian coordinates in the plane. 

What has been said can naturally be developed and extended to the polar (spheri- 
cal) coordinates in R" that we studied in Part 1, where we also exhibited the Jacobian 
of the transition from polar coordinates to Cartesian coordinates in a space R" of 
any dimension. 


11.5.8 Problems and Exercises 

1. a) Show that Lemma 1 is valid for any smooth mapping cp : D t —>■ D x (also see 
Problem 8 below in this connection). 

b) Prove that if D is an open set in HR'" and cp e C^{D, R"), then ip(D) is a set 
of measure zero in R" when m < n . 

2. a) Verify that the measure of a measurable set E and the measure of its image 
( p(E ) under a diffeomorphism cp are connected by the relation fi((p(E)) — 6pi{E), 
where 0 e [inf /S £ | det<p'(f)|, sup re£ | dety/(t)|]. 

b) In particular, if E is a connected set, there is a point r e E such that 
<P(E )) = |det</(r)|/x(£'). 

3. a) Show that if formula (11.10) holds for the function / = 1, then it holds in 
general. 

b) Carry out the proof of Theorem 1 again, but for the special case / = 1, sim- 
plifying it for this special situation. 

4 . Without using Remark 2, carry out the proof of Lemma 3, assuming Lemma 2 
is known and that two integrable functions that differ only on a set of measure zero 
have the same integral. 

5. Instead of the additivity of the integral and the accompanying analysis of the 
measurability of sets, one can use another device for localization when reducing 
formula (11.10) to its local version (that is to the verification of the formula for a 
small neighborhood of the points of the domain being mapped). This device is based 
on the linearity of the integral. 
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a) If the smooth functions e\,...,ek are such that 0 < e; < 1 , i = 1 , . . . , k, and 
e,-(x) = 1 on D x , then f D (£^ =1 e; /)(x) dx — f D f(x) dx for every function 

/ C 7C( D x ). 

b) If suppe; is contained in the set U C D x , then f D (e,/)(x) dx = 
fu(eif)(x)( dx). 

c) Taking account of Lemmas 3 and 4 and the linearity of the integral, one can 
derive formula (11.10) from a) and b), if for every open covering {U a } of the com- 
pact set K = supp / C D x we construct a set of smooth functions e \ , ... ,ek in 
D x such that 0 < e; < 1, i = l, ... ,k, Y!i=i e i = 1 on K, and for every function 
e; e {e;} there is a set U aj e {U a } such that suppe; C U ai . 

In that case the set of functions {e , } is said to be a partition of unity on the compact 
set K subordinate to the covering { L/q. } . 

6. This problem contains a scheme for constructing the partition of unity discussed 
in Problem 5. 

a) Construct a function / e C <00) (R, R) such that /|[_ij] = 1 and supp/ C 
[— 1 — 8, 1 + 5], where 8 > 0. 

b) Construct a function / e C <00> (]R",]R) with the properties indicated in a) for 
the unit cube in R" and its 8 -dilation. 

c) Show that for every open covering of the compact set K C R” there exists a 
smooth partition of unity on K subordinate to this covering. 

d) Extending c), construct a C (0 °^ -partition of unity in R" subordinate to a lo- 
cally finite open covering of the entire space. (A covering is locally finite if every 
point of the set that is covered, in this case E" , has a neighborhood that intersects 
only a finite number of the sets in the covering. For a partition of unity containing 
an infinite number of functions {e, } we impose the requirement that every point 
of the set on which this partition is constructed belongs to the support of at most 
finitely many of the functions {e;}. Under this hypothesis no questions arise as to 
the meaning of the equality JT e, = I ; more precisely, there are no questions as to 
the meaning of the sum on the left-hand side.) 

7. One can obtain a proof of Theorem 1 that is slightly different from the one given 
above and relies on the possibility of decomposing only a linear mapping into a 
composition of elementary mappings. Such a proof is closer to the heuristic consid- 
erations in Sect. 1 1.5.1 and is obtained by proving the following assertions. 

a) Verify that under elementary linear mappings L : R' ! — > E'' of the form 
(x 1 , . . . , x k , . . . , x") m- (x 1 , . . . , x* -1 , Xx k , x k+l , . . . , x"), 

X ^ 0, and 

(x 1 , . . . , x k , . . . , x") !->• (x 1 , . . . , x k ~\x k + x J , . . . , x") 

the relation pt(L{E)) = \ detL'\p.(E) holds for every measurable set E cR“; then 
show that this relation holds for every linear transformation L : R" — * R". (Use 
Fubini’s theorem and the possibility of decomposing a linear mapping into a com- 
position of the elementary mappings just exhibited.) 
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b) Show that if tp : D t — >■ /) A is a diffeomorphism, then p(tp(K)) < 
f K | deti/(f)| dr for every measurable compact set K C D, and its image tp(K). (If 
a e D t , then 3 (tp'(a))~ l and in the representation tp(t) — (c p'(a ) o (tp\a))~ l o tp)(t) 
the mapping tp'(a) is linear while the transformation (up’ (a)) -1 o tp is nearly an 
isometry on a neighborhood of a.) 

c) Show that if the function / in Theorem 1 is nonnegative, then f D f(x) Ax < 
/ A ((/°^)|det^|)(r)dr. 

d) Applying the preceding inequality to the function (/ o tp) | det tp' and the map- 
ping tp~ 1 : D x — >■ D,, show that formula (11.10) holds for a nonnegative function. 

e) By representing the function / in Theorem 1 as the difference of integrable 
nonnegative functions, prove that formula (11.10) holds. 

8. Sard’s lemma. Let D be an open set in R", let tp e C^(D, R"), and let S be the 
set of critical points of the mapping tp. Then tp(S ) is a set of (Lebesgue) measure 
zero. 

We recall that a critical point of a smooth mapping q> of a domain D C R'" into 
R” is a point x e D at which rank</(x) < min {m,n}. In the case m = n, this is 
equivalent to the condition AeXtp\x) = 0. 

a) Verify Sard’s lemma for a linear transformation. 

b) Let I be an interval in the domain D and tp e C (1) (Z), R"). Show that there 
exists a function a(h), a : R" — >• R such that a(h) 0 as h — >• 0 and | tp(x + h) — 

( p(x) — tp'{x)h\ < a(h)\h\ for every x,x + li e I . 

c) Using b), estimate the deviation of the image cp(I) of the interval I under the 
mapping <p from the same image under the linear mapping L(x) = < p(a) + tp'(a)fx — 
a), where a e I . 

d) Based on a), b), and c), show that if S is the set of critical points of the 
mapping cp in the interval /, then <p(S) is a set of measure zero. 

e) Now finish the proof of Sard’s lemma. 

f) Using Sard’s lemma, show that in Theorem 1 it suffices to require that the 
mapping <p be a one-to-one mapping of class C n , ( D, , D x ). 

We remark that the version of Sard’s lemma given here is a simple special case of 
a theorem of Sard and Morse, according to which the assertion of the lemma holds 
even if D C R m and tp e C®(D, R"), where k — max{;w — n + 1, 1}. The quantity k 
here, as an example of Whitney shows, cannot be decreased for any pair of numbers 
m and n . 

In geometry Sard’s lemma is known as the assertion that if cp : D — ► R" is 
a smooth mapping of an open set D C R m into R", then for almost all points 
x e tp(D ), the complete pre-image (p~ 1 (x) — M x in I) is a surface (manifold) of 
codimension n in R" ! (that is, m — dim M x = n for almost all x e D). 

9. Suppose we consider an arbitrary mapping tp e C {l Hlf . D x ) such that 
det i/(f) f 0 in D, instead of the diffeomorphism tp of Theorem 1. Let n(x) = 
card{f e supp(/ o tp) \ <p(t ) = x}, that is, n(x) is the number of points of the sup- 
port of the function f o tp that map to the point x e D x under tp : D, —> D x . The 
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following formula holds: 



• n)(x) dr 



(if°<p) |det<p'|)(f) 


dr. 


a) What is the geometric meaning of this formula for / = 1 ? 

b) Prove this formula for the special mapping of the annulus D t = {t e M.j \ 
1 < \t\ <2} onto the annulus D x — [x e R-J | 1 < \x\ < 2} given in polar coordi- 
nates ( r , tp) and (p, 9) in the planes and M.j respectively by the formulas r — p, 
cp — 29. 

c) Now try to prove the formula in general. 


11.6 Improper Multiple Integrals 
11.6.1 Basic Definitions 

Definition 1 An exhaustion of a set E C K" is a sequence of measurable sets { E n } 
such that E n C £, 2+1 C E for any n e N and U^i E n — E. 

Lemma If {E n } is an exhaustion of a measurable set E, then: 

a) lim n ^oo p(E n ) = p(E)-, 

b) for every function f e 1Z(E) the function f\E n also belongs to 1Z(E„), and 

lim / f(x) dx = / fix) Ax. 
h-s-oo J En J E 

Proof Since E n C E n+ \ C E, it follows that /i ( E n ) < p(E n+l ) < niE) and 
limH^oo piE n ) < piE). To prove a) we shall show that the inequality 
lim„_ >00 piE n ) > piE) also holds. 

The boundary dE of E has content zero, and hence can be covered by a finite 
number of open intervals of total content less than any preassigned number e > 0. 
Let A be the union of all these open intervals. Then the set E U A =: E is open in 
R" ! and by construction E contains the closure of E and piE) < piE) + piA) < 
piE) + e. 

For every set E n of the exhaustion { E n ) the construction just described can 
be repeated with the value e n — e/2". We then obtain a sequence of open sets 
E n — E n U A n such that E n C E n , p(E„) < p)E n ) + piA n ) < piE„) + e n , and 
[JZlEnDUZtEnDE. ~ ~ 

The system of open sets A, E\. /A forms an open covering of the compact 

set E. 

Let A, E\, Ei, •••,£/ be a finite covering of E extracted from this covering. 

Since E\ C £2 C • • • C Ep, the sets A, A 1 Ap, Ep also form a covering of E 

and hence 

piE) < piE) < piEp) + p)A) + p(Ai) H b piAp) < p)Ep) + 2e. 
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It follows from this that ii(E) < lim^oo piE n ). 

b) The relation f \ e e 7 Z)E n ) is well known to us and follows from Lebesgue’s 
criterion for the existence of the integral over a measurable set. By hypothesis / e 
1Z(E), and so there exists a constant M such that \f(x)\ < M on E. From the 
additivity of the integral and the general estimate for the integral we obtain 


[ f{x) Ax - f f{x) dx 
I Je Je„ 



fix) dx 


< M piE\E„). 


From this, together with what was proved in a), we conclude that b) does indeed 
hold. □ 


Definition 2 Let { E n } be an exhaustion of the set E and suppose the function / : 
E — »• R. is integrable on the sets E n e {E n }. If the limit 




dx 


exists and has a value independent of the choice of the sets in the exhaustion of E, 
this limit is called the improper integral of f over E. 


The integral sign on the left in this last equality is usually written for any function 
defined on E, but we say that the integral exists or converges if the limit in Defini- 
tion 2 exists. If there is no common limit for all exhaustions of E, we say that the 
integral of / over E does not exist, or that the integral diverges. 

The purpose of Definition 2 is to extend the concept of integral to the case of an 
unbounded integrand or an unbounded domain of integration. 

The symbol introduced to denote an improper integral is the same as the symbol 
for an ordinary integral, and that fact makes the following remark necessary. 

Remark 1 If £ is a measurable set and f e 1Z{E), then the integral of / over E in 
the sense of Definition 2 exists and has the same value as the proper integral of / 
over E. 


Proof This is precisely the content of assertion b) in the lemma above. □ 

The set of all exhaustions of any reasonably rich set is immense, and we do not 
use all exhaustions. The verification that an improper integral converges is often 
simplified by the following proposition. 

Proposition 1 If a function f: E —> R is nonnegative and the limit in Definition 2 
exists for even one exhaustion {E n } of the set E, then the improper integral of f 
over E converges. 

Proof Let {E' k } be a second exhaustion of E into elements on which / is integrable. 
The sets E * := Ei IT E n , n = 1,2,... form an exhaustion of the set Ei, and so it 
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follows from part b) of the lemma that 

/ fix) dv — lim / f(x)dx< lim / f(x)dx — A. 

JE' k n^ooJ E k n^ooJ En 

Since / > 0 and E' k C E' k+ j C E, it follows that 

3 lim / f(x)dx = B<A. 
k^oo J E > k 

But there is symmetry between the exhaustions { E n } and {£[.}, so that A < B also, 
and hence A — B. □ 

Example 1 Let us find the improper integral / /” R 2 e _( ' r+ - y2) dx dv . 


We shall exhaust the plane R 2 by the sequence of disks E n = { (jc , y) el 2 | 
x 2 + y 2 <n 2 }. After passing to polar coordinates we find easily that 



e-(* 2 +>’ 2 ) d .vdy 



e 1 dr = jt ( 1 



7T 


as n — > oo. 

By Proposition 1 we can now conclude that this integral converges and equals it . 
One can derive a useful corollary from this result if we now consider the exhaus- 
tion of the plane by the squares E' n — {(%, y) e R 2 | |x| < n A |y| < n}. By Fubini’s 
theorem 



e-(^ 2 +v 2 ) dx d3 , 





2 


By Proposition 1 this last quantity must tend to jt as n — »■ oo. Thus, following 
Euler and Poisson, we find that 



dx = . 


Some additional properties of Definition 2 of an improper integral, which are not 
completely obvious at first glance, will be given below in Remark 3. 


11.6.2 The Comparison Test for Convergence of an Improper 
Integral 

Proposition 2 Let f and g be functions defined on the set E and integrable over 
exactly the same measurable subsets of it, and suppose |/(x)| < g(x) on E. If 
the improper integral f E g(x)dx converges, then the integrals j E \f\{x)dx and 
f E f (x) dv also converge. 
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Proof Let { E n [ be an exhaustion of E on whose elements both g and / are inte- 
grate. It follows from the Lebesgue criterion that the function | / 1 is integrate on 
the sets E n , n e N, and so we can write 




where k and n are any natural numbers. When we take account of Proposition 1 and 
the Cauchy criterion for the existence of a limit of a sequence, we conclude that the 
integral f E |/j(x)dx converges. 

Now consider the functions /+ := |(|/| + /) and /_ := \{\f\ — /)• Obviously 
0 < f+ < | /| and 0 < /_ < |/|. By what has just been proved, the improper inte- 
grals of /+ and /_ over E both converge. But / = /+ — /_, and hence the improper 
integral of / over the same set converges as well (and is equal to the difference of 
the integrals of /+ and /_). □ 

In order to make effective use of Proposition 2 in studying the convergence of 
improper integrals, it is useful to have a store of standard functions for comparison. 
In this connection we consider the following example. 

Example 2 In the deleted n -dimensional ball of radius 1, B cK" with its center 
at 0 removed, consider the function 1 /r a , where r = d( 0, x) is the distance from 
the point x e B\ 0 to the point 0. Let us determine the values ofaeK for which 
the integral of r~ a over the domain B\ 0 converges. To do this we construct an 
exhaustion of the domain by the annular regions B(e) = {x e B | e < d(0, x) < 1 }. 

Passing to polar coordinates with center at 0, by Fubini’s theorem, we obtain 



where dtp = dxp\ . . ,d(p n -\ and f((p) is a certain product of sines of the angles 
<Pi, ■ ■ ■ ,<Pn-2 that appears in the Jacobian of the transition to polar coordinates in IB." , 
while c is the magnitude of the integral over s , which depends only on n, not on r 
and e. 

As £ —*■ +0 the value just obtained for the integral over IPs) will have a finite 
limit if a < n. In all other cases this last integral tends to infinity as s: —> +0. 

Thus we have shown that the function da ^ ^ , where d is the distance to the 
point 0, can be integrated in a deleted neighborhood of 0 only when a < n, where n 
is the dimension of the space. 

Similarly one can show that outside the ball B, that is, in a neighborhood of 
infinity, this same function is integrable in the improper sense only for a > n. 

Example 3 Let I — {x e R" | 0 < x l < 1 ,i= 1, . . . , n} be the n -dimensional cube 
and Ik the k-dimensional face of it defined by the conditions x k+l = • • • = x n = 0. 
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On the set I\Ik we consider the function , where cl(x) is the distance from 
x e I\Ik to the face /& . Let us determine the values ofaeR for which the integral 
of this function over I\Ik converges. 

We remark that if x = (x 1 , . . . , x k , x k+l , . . . , x n ) then 



Let 1(e) be the cube I from which the e -neighborhood of the face has been 
removed. By Fubini’s theorem 


f dx _ r , k r dx* +1 ■■■due" _ r d u 

Ji(s)d a (x) J Ik Ji n _ k (e) ((x A+1 ) 2 H h(x ") 2 )“ /2 Ji n _ k (e)\u\ a 


where u — (x* +1 , . . . , x n ) and I n -k(e) is the face /„_£ C R' ,_A from which the e- 
neighborhood of 0 has been removed. 

But it is clear on the basis of the experience acquired in Example 1 that the last 
integral converges only for a < n — k. Hence the improper integral under consider- 
ation converges only for a < n — k, where k is the dimension of the face near which 
the function may increase without bound. 

Remark 2 In the proof of Proposition 2 we verified that the convergence of the in- 
tegral |/| implies the convergence of the integral of /. It turns out that the con- 
verse is also true for an improper integral in the sense of Definition 2, which was 
not the case previously when we studied improper integrals on the line. In the 
latter case, we distinguished absolute and nonabsolute (conditional) convergence 
of an improper integral. To understand right away the essence of the new phe- 
nomenon that has arisen in connection with Definition 2, consider the following 
example. 

Example 4 Let the function / : R_|_ R be defined on the set K_|_ of nonnegative 

/ i\n— 1 

numbers by the following conditions: f(x) = - — j — , if n — 1 <x < n, n e N. 

Since the series 2^ n =i ~ — i — converges, the integral J Q /(x) dx has a limit as 
A oo equal to the sum of this series. 

However, this series does not converge absolutely, and one can make it divergent 
to +oo, for example, by rearranging its terms. The partial sums of the new series 
can be interpreted as the integrals of the function / over the union E n of the closed 
intervals on the real line corresponding to the terms of the series. The sets E n , taken 
all together, however, form an exhaustion of the domain R + on which / is defined. 

Thus the improper integral f Q f(x) dx of the function / : M+ — > R exists in its 
earlier sense, but not in the sense of Definition 2. 

We see that the condition in Definition 2 that the limit be independent of the 
choice of the exhaustion is equivalent to the independence of the sum of a series 
on the order of summation. The latter, as we know, is exactly equivalent to absolute 
convergence. 
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In practice one nearly always has to consider only special exhaustions of the fol- 
lowing type. Let a function / : D —> R defined in the domain D be unbounded in 
a neighborhood of some set E C 3 D. We then remove from D the points lying in 
the e -neighborhood of E and obtain a domain D(s) C D. As e — > 0 these domains 
generate an exhaustion of D. If the domain is unbounded, we can obtain an exhaus- 
tion of it by taking the D-complements of neighborhoods of infinity. These are the 
special exhaustions we mentioned earlier and studied in the one-dimensional case, 
and it is these special exhaustions that lead directly to the generalization of the no- 
tion of Cauchy principal value of an improper integral to the case of a space of any 
dimension, which we discussed earlier when studying improper integrals on the line. 


11.6.3 Change of Variable in an Improper Integral 

In conclusion we obtain the formula for change of variable in improper integrals, 
thereby making a valuable, although very simple, supplement to Theorems 1 and 2 
of Sect. 11.5. 

Theorem 1 Let tp : D t — »■ D x be a diffeomorphism of the open set D, C R” onto the 
set D x C R” of the same type, and let f : D x — > R be integrable on all measurable 
compact subsets of D x . If the improper integral f D fix) dr converges, then the 
integral f D ((/ o cp) | det < p' | ) (?) d? also converges and has the same value. 

Proof The open set D, C R" can be exhausted by a sequence of compact sets if , 
k e N, contained in N, each of which is the union of a finite number of intervals 
in R" (in this connection, see the beginning of the proof of Lemma 1 in Sect. 1 1 .5). 
Since ip : D, — > D x is a diffeomorphism, the exhaustion E k of D x , where E k — 
tp(E k ), corresponds to the exhaustion {E k } of D,. Here the sets E k x — ip(E k ) are 
measurable compact sets in D x (measurability follows from Lemma 1 of Sect. 1 1 .5). 
By Proposition 1 of Sect. 1 1.5 we can write 



The left-hand side of this equality has a limit by hypothesis as k oo. Hence 


□ 


the right-hand side also has the same limit. 


Remark 3 By the reasoning just given we have verified that the integral on the right- 
hand side of the last equality has the same limit for any exhaustion D, of the given 
special type. It is this proven part of the theorem that we shall be using. But for- 
mally, to complete the proof of the theorem in accordance with Definition 2 it is 
necessary to verify that this limit exists for every exhaustion of the domain D, . We 
leave this (not entirely elementary) proof to the reader as an excellent exercise. We 
remark only that one can already deduce the convergence of the improper integral 
of |/ o <p\ \ detip'l over the set D t (see Problem 7). 
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Theorem 2 Let <p : I), — > D x be a mapping of the open sets D, and D x . Assume 
that there are subsets S t and S x of measure zero contained in D t and D x respec- 
tively such that D t \S r and D X \S X are open sets and tp is a diffeomorphism of the 
former onto the latter. Under these hypotheses, if the improper integral f D f(x ) dv 
converges, then the integral f D ^ iff o (p) \ det^/|)(f) d t also converges to the same 
value. If in addition \ dct tp'\ is defined and bounded on compact subsets of D t , then 
(/ o tp) | deXtp'\xs improperly integrable over the set D t , and the following equality 
holds: 




((/°</=0|det<p'|)(f) 


dr. 


Proof The assertion is a direct corollary of Theorem 1 and Theorem 2 of Sect. 1 1.5, 
provided we take account of the fact that when finding an improper integral over 
an open set one may restrict consideration to exhaustions that consist of measurable 
compact sets (see Remark 3). □ 


Example 5 Let us compute the integral ff x 2 +y 2 <1 ^_ x 2 _ y 2 )a » which is an improper 
integral when a > 0, since the integrand is unbounded in that case in a neighborhood 
of the disk x 2 + y 2 = 1 . 

Passing to polar coordinates, we obtain from Theorem 2 


Jl 


dx dy 


Hy 2 <l (1-x 2 - y 2 ) 01 


=J{, 


r dr dtp 


0<(p<2n (1 
0<r<l 


2\a ‘ 


r 2 ) 


For a > 0 this last integral is also improper, but, since the integrand is nonneg- 
ative, it can be computed as the limit over the special exhaustion of the rectangle 
I = {(r, <p) e R 2 | 0 < tp < 2n A 0 < /- < 1} by the rectangles /„ = {(r, tp) e R 2 | 
0<i/9<23rA0<r<l-M,neN. Using Fubini’s theorem, we find that 


JL 


r dr dtp 

0<(p<2n (1 — y2\a 
0<r<l 


C 2n r l ~n r dr 
lim / dtp / = — 

n ^°°Jo Jo (1 -r 2 )" 


it 


By the same considerations, one can deduce that the original integral diverges for 
a > 1 . 


Example 6 Let us show that the integral //j v | tir'+i v|? converges only under 

the condition - + - < 1. 
p q 

Proof In view of the obvious symmetry it suffices to consider the integral only over 
the domain D in which x > 0, y > 0 and x + y > 1. 

It is clear that the simultaneous conditions p > 0 and q > 0 are necessary for the 
integral to converge. Indeed, if p < 0 for example, we would obtain the following 
estimate for the integral over the rectangle I a = {(x, y) e K 2 | 1 < x < A A 0 < 
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y < 1} alone, which is contained in D: 

ff dxdy [ A [ l dy f A d y 

JJi A \x\ p + \y\ q Ji ±x Jo \x\ p + \y\ q ~Ji Jo i + lyl 9 
= (A - 1) f 1 — — — , 

Jo i + |y|« 

which shows that as A — »■ oo, this integral increases without bound. Thus from now 
on we may assume that p > 0 and q > 0. 

The integrand has no singularities in the bounded portion of the domain D, so that 
studying the convergence of this integral is equivalent to studying the convergence 
of the integral of the same function over, for example, the portion G of the domain 
D where x p + y q >a> 0. The number a can be assumed sufficiently large that the 
curve x p + y q = a lies in D for x > 0 and y > 0. 

Passing to generalized curvilinear coordinates <p using the formulas 

x = (r cos 2 (p) 1 ^ , y = (r sin 2 cp) , 


by Theorem 2 we obtain 



d.v dy 
\x\ p + |y|? 


2 

p-q 



a<r<o o 


COSP 


(p sin 


(p) dr dtp. 


Using the exhaustion of the domain {(r, q>) e R 2 | 0 < <p < 7r/2 a a < r < oo} 
by intervals I e a — {(r, <p)eR 2 |0<£<^< tc/2 — s Aa <r < A) and applying 
Fubini’s theorem, we obtain 


fi 


0<(p<n/2 
a<r< oo 


(/-p + <7 “cos? ^5 sin*? tp)drd(p = 


= lim 

£■ 


im [ 

J E 


7T/2 e 2_J 2_J 

cosp sin“? cpdcp lim 
A — >oo 


f A I + I 

/ /-U? 

J a 


-2 


dr. 


Since p > 0 and q > 0, the first of these limits is necessarily finite and the second 

is finite only when - + - < 1 . □ 

J p q 


11.6.4 Problems and Exercises 

1. Give conditions on p and q under which the integral //o < | l |+|- y |<i con_ 

verges. 

2. a) Does the limit limA-i-oo Jq cos a 2 dA exist? 

b) Does the integral / K i cos a 2 dx converge in the sense of Definition 2? 
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c) By verifying that 


lim 

n — >■ oo 



+ y 2 ) dx d y 


= 7T 


and 

lim // sin(x 2 + v 2 ) dx dy = 0 

n ^°°J Jx 2 +y 2 <2nn 

verify that the integral of sin(x 2 + y 2 ) over the plane R 2 diverges. 

3. a) Compute the integral /„' / 0 X /J 

b) One must be careful when applying Fubini’s theorem to improper integrals 

(but of course one must also be careful when applying it to proper integrals). Show 

2 2 

that the integral ff x>l V>1 dxdy diverges, while both of the iterated inte- 
grals /“ dx /,°° (* 2+^2 dy and /“ dy /“ dx converge. 

c) Prove that if f e C(R 2 , R) and / > 0 in R 2 , then the existence of either of 
the iterated integrals dx f^° OQ f(x,y)dy and /^dy /^ c /(x,y)dx implies 
that the integral f f- : i fix, y) dx dy converges to the value of the iterated integral in 
question. 

4 . Show that if / e C(R, R), then 

i r' h 

, lim n “ / -f(x)dx = /(0). 

/i-s-0 7 r J_ i h A + x- 


5. Let D be a bounded domain in R" with a smooth boundary and S a smooth 
A:-dimensional surface contained in the boundary of D. Show that if the function 
/ e C(D,R) admits the estimate |/| < d „\_ e , where d = d{S,x ) is the distance 
from xeDtoS and e > 0, then the integral of / over D converges. 

6. As a supplement to Remark 1 show that it remains valid even if the set E is not 
assumed to be measurable. 

7. Let D be an open set in R" and let the function /:/)—> R be integrable over 
any measurable compact set contained in D. 

a) Show that if the improper integral of the function |/| over D diverges, then 
there exists an exhaustion { E„ } of D such that each set E n is an elementcuy compact 
set , consisting of a finite number of n -dimensional intervals and ff E |/|(x)dx — » 
+oo as n — > oo. 

b) Verify that if the integral of / over a set converges while the integral of |/| 
diverges, then the integrals of /+ = \i\f\ + f ) and /_ = \i\f\ — /) over the set 
both diverge. 

c) Show that the exhaustion {£),} obtained in a) can be distributed in such a way 
that f En+l \ En f+ (x) dx > f En \f\(x) dx for all n e N. 

d) Using lower Darboux sums, show that if f E / + (x)dx > A, then there exists 
an elementary compact set F C E consisting of a finite number of intervals such 
that J E fix) dx > A. 
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e) Deduce from c) and d) that there exists an elementary compact set F n C 
E n +i\E n for which f Fn /(x)dx > f En |/|(x)dx +n. 

f) Show using e) that the sets G„ = F n fl E n are elementary compact sets (that 
is, they consist of a finite number of intervals) contained in D that, taken together, 
constitute an exhaustion of D, and for which the relation f G /(x)dx — > +oo as 
n — » oo holds. 

Thus, if the integral of |/| diverges, then the integral of / (in the sense of Defi- 
nition 2) also diverges. 

8. Carry out the proof of Theorem 2 in detail. 

9 . We recall that if x = (jc 1 .... , x") and £ = (f . . . , £"), then (x, = x'f 1 + 

■•• + x"^ is the standard inner product in R". Let A — (a/j) be a symmetric 
n x n matrix of complex numbers. We denote by Re A the matrix with elements 
R eajj. Writing ReA > 0 (resp. ReA > 0) means that ((ReA)x,x) > 0 (resp. 
((Re A)x, x) > 0) for every x e M", x ^ 0. 

a) Show that if Re A > 0, then for a > 0 and § e R" we have 




Here the branch of Vdet A is chosen as follows: 

(detA)“ 1/2 = | det A| _1,/2 exp(— i Ind A), 




y'=l 


where fij(A) are the eigenvalues of A. 

b) Let A be a real-valued symmetric nondegenerate (n x n) matrix. Then for 
^ e R' 1 and X > 0 we have 




Here sgn A is the signature of the matrix, that is, 


sgn A = t’+(A) — v_(A), 


where v+(A) is the number of positive eigenvalues of A and u_ ( A ) the number of 
negative eigenvalues. 


Chapter 12 

Surfaces and Differential Forms in M" 


In this chapter we discuss the concepts of surface, boundary of a surface, and con- 
sistent orientation of a surface and its boundary; we derive a formula for computing 
the area of a surface lying in R" ; and we give some elementary information on dif- 
ferential forms. Mastery of these concepts is very important in working with line 
and surface integrals, to which the next chapter is devoted. 


12.1 Surfaces in W 1 

The standard model for a ^-dimensional surface is R*. 

Definition 1 A surface of dimension k (or k-dimensional surface or k-dimensional 
manifold ) in R" is a subset S C R" each point of which has a neighborhood 1 in S 
homeomorphic 2 to R* . 

Definition 2 The mapping <p : R* 1 — »■ U C S provided by the homeomorphism re- 
ferred to in the definition of a surface is called a chart or a local chart of the sur- 
face S, R a ’ is called the parameter domain, and U is the range or domain of action 
of the chart on the surface S. 

A local chart introduces curvilinear coordinates in U by assigning to the point 
x = cp(t) e U the set of numbers t — (f 1 , . . . , t k ) e R A ’ . It is clear from the definition 
that the set of objects S described by the definition does not change if R* is replaced 


1 As before, a neighborhood of a point x e 5 C R" in S is a set Us (x) = SO U(x), where U (x) is a 
neighborhood of x in R" . Since we shall be discussing only neighborhoods of a point on a surface 
in what follows, we shall simplify the notation where no confusion can arise by writing U or U (x) 
instead of Us(x). 

2 On S C R" and hence also on U C S there is a unique metric induced from R", so that one can 
speak of a topological mapping of U into M. k . 
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in it by any topological space homeomorphic to R * . Most often the standard param- 
eter region for local charts is assumed to be an open cube I k or an open ball B k 
in R^ . But this makes no substantial difference. 

To carry out certain analogies and in order to make a number of the following 
constructions easier to visualize, we shall as a rule take a cube I k as the canonical 
parameter domain for local charts on a surface. Thus a chart 

(p : I k U C S (12.1) 

gives a local parametric equation x — (pit) for the surface 5 C K", and the k- 
dimensional surface itself thus has the local structure of a deformed standard k- 
dimensional interval I k C R" . 

The parametric definition of a surface is especially important for computational 
purposes, as will become clear below. Sometimes one can define the entire surface 
by a single chart. Such a surface is usually called elementary. For example, the graph 
of a continuous function / : I k —*■ R in R* +1 is an elementary surface. However, 
elementary surfaces are more the exception than the rule. For example, our ordinary 
two-dimensional terrestrial sphere cannot be defined by only one chart. An atlas of 
the surface of the Earth must contain at least two charts (see Problem 3 at the end of 
this section). 

In accordance with this analogy we adopt the following definition. 

Definition 3 A set A(S) := \(pj : I k — > i e N | of local charts of a surface S 
whose domains of action together cover the entire surface (that is, S = [J ( Ui) i s 
called an atlas of the surface S. 

The union of two atlases of the same surface is obviously also an atlas of the 
surface. 

If no restrictions are imposed on the mappings (12.1), the local parametrizations 
of the surface, except that they must be homeomorphisms, the surface may be situ- 
ated very strangely in R" . For example, it can happen that a surface homeomorphic 
to a two-dimensional sphere, that is, a topological sphere, is contained in R 3 , but 
the region it bounds is not homeomorphic to a ball (the so-called Alexander horned 
sphere)? 

To eliminate such complications, which have nothing to do with the questions 
considered in analysis, we defined a smooth k-dimensional surface in R" in Sect. 8.7 
to be a set S C R" such that for each a'q e S there exists a neighborhood U (xq) 
in R" and a diffeomorphism if/ : U (.*o) — >■ /" = {f eR" | \t\ < 1, i = 1, . . . , n] un- 
der which the set Us(xq) := S fl U(x o) maps into the cube I k = /" fl {t e R" | 
t k+l = ■■■ = t n =0}. 

It is clear that a surface that is smooth in this sense is a surface in the sense 
of Definition 1, since the mappings x — f~ l (t l , ... ,t k , 0, . . . , 0) = <p(t l , ■ ■ ■ ,t k ) 


3 An example of the surface described here was constructed by the American topologist 
J.W. Alexander (1888-1977). 
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obviously define a local parametrization of the surface. The converse, as follows 
from the example of the horned sphere mentioned above, is generally not true, if 
the mappings < -p are merely homeomorphisms. However, if the mappings (12.1) are 
sufficiently regular, the concept of a surface is actually the same in both the old and 
new definitions. 

In essence this has already been shown by Example 8 in Sect. 8.7, but considering 
the importance of the question, we give a precise statement of the assertion and 
recall how the answer is obtained. 

Proposition If the mapping (12.1) belongs to class C^(/^,R”) and has maximal 
rank at each point of the cube I k , there exists a number s > 0 and a dijfeomorphism 
(p e : I" —> R" of the cube /" := {f e R" | |f'| < g,-,i = 1 ,...,«} of dimension n 
in R" such that (p\jk nI n = <p e \ jk nI n . 

In other words, it is asserted that under these hypotheses the mappings (12.1) are 
locally the restrictions of diffeomorphisms of the full-dimensional cubes I" to the 
A'-dimensional cubes l k = I k fl I". 

Proof Suppose for definiteness that the first k of the n coordinate functions x k = 
(p 1 (r 1 , . . . , t k ), i — 1, . . . , n, of the mapping x = q>(1) are such that det(|^-)(0) ^ 0, 
i, j = \ k. Then by the implicit function theorem the relations 


x 



x k = <p\t\...,t k ), 
x k+l =<p k+l (t l , ...,t k ), 


k -\- 1 1 ( f 1 


x n =q> n (t\...,t k ) 


near the point (fo, -*o) = (0, ^>(0)) are equivalent to relations 




x k+l = 



x 


n 


f n (x\...,x k ). 


166 


12 Surfaces and Differential Forms in R' 


In this case the mapping 




t l = ...,x k ), 



k + 1 _ k + 1 _ 





is a diffeomorphism of a full-dimensional neighborhood of the point xq e R" . As q> e 
we can now take the restriction to some cube /" of the diffeomorphism inverse to 


□ 


it. 


By a change of scale, of course, one can arrange to have s — 1 and a unit cube 
/" in the last diffeomorphism. 

Thus we have shown that for a smooth surface in R" one can adopt the following 
definition, which is equivalent to the previous one. 

Definition 4 The ^'-dimensional surface in R" introduced by Definition 1 is smooth 
(of class C (m> , m > 1) if it has an atlas whose local charts are smooth mappings (of 
class C (m) , m > 1) and have rank k at each point of their domains of definition. 

We remark that the condition on the rank of the mappings (12.1) is essential. For 
example, the analytic mapping K9tH- (x 1 ,x 2 ) e R 2 defined by x 1 = t 2 , x 2 = f 3 
defines a curve in the plane R 2 having a cusp at (0, 0). It is clear that this curve is 
not a smooth one-dimensional surface in R 2 , since the latter must have a tangent 
(a one-dimensional tangent plane) at each point. 4 

Thus, in particular one should not conflate the concept of a smooth path of class 
C ( - m ' > and the concept of a smooth curve of class C (m \ 

In analysis, as a rule, we deal with rather smooth parametrizations (12.1) of 
rank k. We have verified that in this case Definition 4 adopted here for a smooth 
surface agrees with the one considered earlier in Sect. 8.7. However, while the pre- 
vious definition was intuitive and eliminated certain unnecessary complications im- 
mediately, the well-known advantage of Definition 4 of a surface, in accordance 
with Definition 1, is that it can easily be extended to the definition of an abstract 
manifold, not necessarily embedded in R" . For the time being, however, we shall be 
interested only in surfaces in R" . 

Let us consider some examples of such surfaces. 


4 For the tangent plane see Sect. 8.7. 
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Example 1 We recall that if F‘ e C^ n) (W , R), i = 1 , — k, is a set of smooth 
functions such that the system of equations 


F 1 (x 1 , . . . , x k , x k+l x") — 0, 

F n ~ k (x\...,x k ,x k+ \...,x n ) = 0 


(12.2) 


has rank n — k at each point in the set S of its solutions, then either this system 
has no solutions at all or the set of its solutions forms a ^-dimensional -smooth 
surface S in R" . 


Proof We shall verify that if S 0, then S does indeed satisfy Definition 4. This 
follows from the implicit function theorem, which says that in some neighborhood 
of each point xq e S the system (12.2) is equivalent, up to a relabeling of the vari- 
ables, to a system 

' x k+1 = f k+1 {x\...,x k ), 
x n = f n (x\...,x k ) 

where f k+l , . . . , f" e C (ml . By writing this last system as 


x k+l = f k+l (t\ ...,t k ). 


x n = f n {t\...,t k ), 

we arrive at a parametric equation for the neighborhood of the point xq e S on S. 
By an additional transformation one can obviously turn the domain into a canonical 
domain, for example, into I k and obtain a standard local chart (12.1). □ 

Example 2 In particular, the sphere defined in R" by the equation 

(x 1 ) 2 H h (x") 2 = r 2 (r> 0) (12.3) 

is an (n — 1) -dimensional smooth surface in R" since the set S of solutions of 
Eq. (12.3) is obviously nonempty and the gradient of the left-hand side of (12.3) 
is nonzero at each point of S. 


168 


12 Surfaces and Differential Forms in R' 


When n = 2, we obtain the circle in R 2 given by 

(x l ) 2 + (J) 2 = S, 

which can easily be parametrized locally by the polar angle 0 using the polar coor- 
dinates 

I x 1 = r cos 0, 
x 2 = r sin0. 

For fixed r > 0 the mapping 6 (x 1 , x 2 )(0) is a diffeomorphism on every inter- 
val of the form 9o <6 <6o + 2it, and two charts (for example, those corresponding 
to values 9q — 0 and 0o = — it) suffice to produce an atlas of the circle. We could not 
get by with one canonical chart (12.1) here because a circle is compact, in contrast 
to R 1 or 1 1 = B 1 , and compactness is invariant under topological mappings. 

Polar (spherical) coordinates can also be used to parametrize the two-dimensional 
sphere 

(x l f +(x 2 f +(x i f = r 2 

in R 3 . Denoting by \[r the angle between the direction of the vector (x 1 , x 2 , x 3 ) and 
the positive x 3 -axis (that is, 0 < i,// < jx) and by <p the polar angle of the projection 
of the radius- vector (x 1 , x 2 , x 3 ) onto the (x 1 , x 2 ) -plane, we obtain 

x 3 = r cos \fr, 
x 2 = r sin \jr sin ip, 
x 1 = r sin \[r cosy. 


In general polar coordinates (r, 9 \ , . . . , 0„_i) in R" are introduced via the rela- 
tions 

x 1 = r cos 9\, 
x 2 = r sin 0i cos 02 , 


(12.4) 


x"~' — r sin 0i sin 02 • ... • sin 0„_2 cos0„_i, 
x n —r sin 0i sin 02 • ... • sin0„_i sin0„_i- 
We recall the Jacobian 


J = r n 'sin" 2 0i sin" 3 02 • . . . • sin0„_2 (12.5) 

for the transition (12.4) from polar coordinates (r, 0i, . . . , 6„ - \ ) to Cartesian coor- 
dinates (x 1 , . . . , x") in R". It is clear from the expression for the Jacobian that it is 
nonzero if, for example, 0 < 0; <n , i = I ..... u — 2, and r > 0. Hence, even with- 
out invoking the simple geometric meaning of the parameters 0i , . . . , 0„-i, one can 
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guarantee that for a fixed r > 0 the mapping (6>i , . . . , 0 n -\) i->- (x 1 , . . . , x n ), being 
the restriction of a local diffeomorphism (r, 0\, ... , Q n -\ ) h->- (x 1 , . . . , x n ) is itself a 
local diffeomorphism. But the sphere is homogeneous under the group of orthogo- 
nal transformations of R" , so that the possibility of constructing a local chart for a 
neighborhood of any point of the sphere now follows. 

Example 3 The cylinder 

(x 1 ) 2 H h (x k ) 2 = r 2 (r > 0), 

for k < n is an in — 1) -dimensional surface in R" that is the direct product of the 
(k— 1) -dimensional sphere in the plane of the variables (x 1 , . . . , x k ) and the ( n — k )- 
dimensional plane of the variables (x^ +1 , . . . , x"). 

A local parametrization of this surface can obviously be obtained if we take 
the first k — 1 of the n — 1 parameters (7 1 , . . . , t n ~ 1 ) to be the polar coordinates 
9\, ... , Ok - 1 of a point of the ( k — 1) -dimensional sphere in R* and set t k , ... , f' 1-1 
equal to x k+l , ..., x" respectively. 

Example 4 If we take a curve (a one-dimensional surface) in the plane x — 0 of R 3 
endowed with Cartesian coordinates (x, y, z), and the curve does not intersect the 
Z-axis, we can rotate the curve about the z-axis and obtain a 2-dimensional surface. 
The local coordinates can be taken as the local coordinates of the original curve (the 
meridian) and, for example, the angle of revolution (a local coordinate on a parallel 
of latitude). 

In particular, if the original curve is a circle of radius a with center at (b. 0, 0), 
for a < b we obtain the two-dimensional torus (Fig. 12.1). Its parametric equation 
can be represented in the form 


x — (b + a cos i j/) cos< p, 
y = (b + a cos i/r ) sin q>, 

Z = a sin \[r, 

where \[r is the angular parameter on the original circle - the meridian - and <p is the 
angle parameter on a parallel of latitude. 

It is customary to refer to any surface homeomorphic to the torus of revolution 
just constructed as a torus (more precisely, a two-dimensional torus). As one can 
see, a two-dimensional torus is the direct product of two circles. Since a circle can 
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be obtained from a closed interval by gluing together (identifying) its endpoints, 
a torus can be obtained from the direct product of two closed intervals (that is, a 
rectangle) by gluing the opposite sides together at corresponding points (Fig. 12.2). 

In essence, we have already made use of this device earlier when we established 
that the configuration space of a double pendulum is a two-dimensional torus, and 
that a path on the torus corresponds to a motion of the pendulum. 

Example 5 If a flexible ribbon (rectangle) is glued along the arrows shown 
in Fig. 12.3a, one can obtain an annulus (Fig. 12.3c) or a cylindrical surface 
(Fig. 12.3b), which are the same from a topological point of view. (These two sur- 
faces are homeomorphic.) But if the ribbon is glued together along the arrows shown 
in Fig. 12.4a, we obtain a surface in R 3 (Fig. 12.4b) called a Mobius band. 5 6 

Local coordinates on this surface can be naturally introduced using the coordi- 
nates on the plane in which the original rectangle lies. 

Example 6 Comparing the results of Examples 4 and 5 in accordance with the nat- 
ural analogy, one can now prescribe how to glue a rectangle (Fig. 12.5a) that com- 
bines elements of the toms and elements of the Mobius band. But, just as it was 
necessary to go outside R 2 in order to glue the Mobius band without tearing or self- 
intersections, the gluing prescribed here cannot be carried out in R 3 . However, this 
can be done in R 4 , resulting in a surface in R 4 usually called the Klein bottle. 5 An 
attempt to depict this surface has been undertaken in Fig. 12.5b. 


a. 




Fig. 12.3 


5 A.F. Mobius (1790-1868) - German mathematician and astronomer. 

6 F.Ch. Klein (1849-1925) - outstanding German mathematician, the first to make a rigorous in- 
vestigation of non-Euclidean geometry. An expert in the history of mathematics and one of the 
organizers of the “Encyclopadie der mathematischen Wisaenschaftm”. 
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Fig. 12.5 



This last example gives some idea of how a surface can be intrinsically described 
more easily than the same surface lying in a particular space R" . Moreover, many 
important surfaces (of different dimensions) originally arise not as subsets of R" , 
but, for example, as the phase spaces of mechanical systems or the geometric image 
of continuous transformation groups of automorphisms, as the quotient spaces with 
respect to groups of automorphisms of the original space, and so on, and so forth. 
We confine ourselves for the time being to these introductory remarks, waiting to 
make them more precise until Chap. 15, where we shall give a general definition 
of a surface not necessarily lying in R" . But already at this point, before the def- 
inition has even been given, we note that by a well-known theorem of Whitney 7 
any k-dimensional surface can be mapped homeomorphically onto a surface ly- 
ing in ]S 2k+ 1 . Hence in considering surfaces in R" we really lose nothing from the 
point of view of topological variety and classification. These questions, however, 
are somewhat off the topic of our modest requirements in geometry. 


12.1.1 Problems and Exercises 

1. For each of the sets E a given by the conditions 

E a = {(x,y) el 2 | x 2 - y 2 = u}, 

E a — { (x , y, z) e R ? | x 2 — y 2 = a}, 

E a = { (x, y,z) [ x 2 + y 2 - z 2 = a}, 

E a — {<: e C | \z 2 - 1 1 = a}, 

7 H. Whitney (1907-1989) - American topologist, one of the founders of the theory of fiber bun- 
dles. 
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depending on the value of the parameter a e R, determine 

a) whether E a is a surface; 

b) if so, what the dimension of E a is; 

c) whether E a is connected. 

2. Let / : R" -> R" be a smooth mapping satisfying the condition / o / = /. 

a) Show that the set f(R") is a smooth surface in R". 

b) By what property of the mapping / is the dimension of this surface deter- 
mined? 

3. Let eo,ei, e„ be an orthonormal basis in the Euclidean space R" +1 , let x = 

x°eo + x 1 ei H \-x n e n , let {x} be the point (x°,x l ,...,x n ), and let e \, . . . , e n be 

a basis in R" C R" +1 . 

The formulas 


x-x^eo 

W\ = 1 _ y0 for x ^ e 0 , 
define the stereographic projections 

^i:S"\{eo}->R", 


x-x^eo 

^2= 1 + Y o for x =/= —eo 


t ■ 5"\{-e 0 } ■ 


from the points {cq} and {— eol respectively. 

a) Determine the geometric meaning of these mappings. 

b) Verify that if t e R" and t ^ 0, then {^2 ° i/fj where 1 — 

(1A1I S„\{e 0 }) _1 - 

c) Show that the two charts 1 /r “ 1 = (pi : R" -> S’"\{eo} and i/r^ 1 = q >2 : R" —> 
S n \{— eo} form an atlas of the sphere S" C R" +1 . 

d) Prove that every atlas of the sphere must have at least two charts. 


12.2 Orientation of a Surface 

We recall first of all that the transition from one frame ei, . . . , e„ in R" to a sec- 
ond frame ei , . . . , e„ is effected by means of the square matrix obtained from the 
expansions e/ = ale,-. The determinant of this matrix is always nonzero, and the set 
of all frames divides into two equivalence classes, each class containing all possi- 
ble frames such that for any two of them the determinant of the transition matrix is 
positive. Such equivalence classes are called orientation classes of frames in R" . 

To define an orientation means to fix one of these orientation classes. Thus, the 
oriented space R" is the space R" itself together with a fixed orientation class of 
frames. To specify the orientation class it suffices to exhibit any of the frames in it, 
so that one can also say that the oriented space R” is R" together with a fixed frame 
in it. 
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A frame in R" generates a coordinate system in R” , and the transition from one 
such coordinate system to another is effected by the matrix (a! ) that is the transpose 
of the matrix (a 1 .) that connects the two frames. Since the determinants of these 
two matrices are the same, everything that was said above about orientation can be 
repeated on the level of orientation classes of coordinate systems in R", placing in 
one class all the coordinate systems such that the transition matrix between any two 
systems in the same class has a positive Jacobian. 

Both of these essentially identical approaches to describing the concept of an 
orientation in R" will also manifest themselves in describing the orientation of a 
surface, to which we now turn. 

We recall, however, another connection between coordinates and frames in the 
case of curvilinear coordinate systems, a connection that will be useful in what is to 
follow. 

Let G and D be diffeomorphic domains lying in two copies of the space R" en- 
dowed with Cartesian coordinates (x 1 , . . . , x") and (? 1 , . . . , t") respectively. A dif- 
feomorphism <p : D —*■ G can be regarded as the introduction of curvilinear coordi- 
nates (t 1 , . . . , t n ) into the domain G via the rule x = (pit), that is, the point x e G 
is endowed with the Cartesian coordinates (t 1 , . . . , t n ) of the point t — tp ~ 1 (x) e D . 
If we consider a frame ej, . . . , e„ of the tangent space 7’R" at each point t e D 
composed of the unit vectors along the coordinate directions, a field of frames 
arises in D, which can be regarded as the translations of the orthogonal frame of 
the original space R" containing D, parallel to itself, to the points of D. Since 
tp : D —>■ G is a diffeomorphism, the mapping (p'(t) : TD, — > TG x=v> ( t ) of tangent 
spaces effected by the rule TD, 3 e^ <p'(t)e = £ e TG X , is an isomorphism of the 
tangent spaces at each point t. Hence from the frame ei , . . . , e„ in TD, we obtain 
a frame § j = <p'(t)e \ = cp'(t)e n in TG X , and the field of frames on D trans- 

forms into a field of frames on G (see Fig. 12.6). Since <p e C (1 '(D, G), the vector 
field §(x) = %((p{t)) — <p'(t)e(t) is continuous in G if the vector field e(t) is con- 
tinuous in D. Thus every continuous field of frames (consisting of n continuous 
vector fields) transforms under a diffeomorphism to a continuous field of frames. 
Now let us consider a pair of diffeomorphisms (p, : D, —x G , i — 1 , 2, which in- 
troduce two systems of curvilinear coordinates (t 1 , . . . , t" ) and (t^ , . . . .tf) into the 
same domain G. The mutually inverse diffeomorphisms cpf 1 o (p\ : D\ -x Dj and 
tpf 1 o q >2 : £>2 —*■ D\ provide mutual transitions between these coordinate systems. 
The Jacobians of these mappings at corresponding points of Z>i and Z >2 are mutually 
inverse to each other and consequently have the same sign. If the domain G (and 
together with it D i and ZH) is connected, then by the continuity and nonvanishing 
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of the Jacobians under consideration, they have the same sign at all points of the 
domains D i and D 2 respectively. 

Hence the set of all curvilinear coordinate systems introduced in a connected do- 
main G by this method divide into exactly two equivalence classes when each class 
is assigned systems whose mutual transitions are effected with a positive Jacobian. 
Such equivalence classes are called the orientation classes of curvilinear coordinate 
systems in G. 

To define an orientation in G means by definition to fix an orientation class of its 
curvilinear coordinate systems. 

It is not difficult to verify that curvilinear coordinate systems belonging to the 
same orientation class generate continuous fields of frames in G (as described 
above) that are in the same orientation class of the tangent space TG X at each point 
x e G. It can be shown in general that, if G is connected, the continuous fields of 
frames on G divide into exactly two equivalence classes if each class is assigned 
the fields whose frames belong to the same orientation class of frames of the space 
TG X at each point x e G (in this connection, see Problems 3 and 4 at the end of this 
section). 

Thus the same orientation of a domain G can be defined in two completely equiv- 
alent ways: by exhibiting a curvilinear coordinate system in G, or by defining any 
continuous field of frames in G, all belonging to the same orientation class as the 
field of frames generated by this coordinate system. 

It is now clear that the orientation of a connected domain G is completely de- 
termined if a frame that orients TG X is prescribed at even one point x e G. This 
circumstance is widely used in practice. If such an orienting frame is defined at 
some point xo e G, and a curvilinear coordinate system ip : D — > G is taken in G, 
then after constructing the frame induced by this coordinate system in TG Xq , we 
compare it with the orienting frame in TG X . If the two frames both belong to the 
same orientation class of TG X0 , we regard the curvilinear coordinates as defining 
the same orientation on G as the orienting frame. Otherwise, we regard them as 
defining the opposite orientation. 

If G is an open set, not necessarily connected, since what has just been said is 
applicable to any connected component of G, it is necessary to define an orienting 
frame in each component of G in order to orient G. Hence, if there are m compo- 
nents, the set G admits 2 m different orientations. 

What has just been said about the orientation of a domain G C M" can be repeated 
verbatim if instead of the domain G we consider a smooth ^'-dimensional surface S 
in M" defined by a single chart (see Fig. 12.7). In this case the curvilinear coordinate 
systems on S also divide naturally into two orientation classes in accordance with 
the sign of the Jacobian of their mutual transition transformations; fields of frames 
also arise on S; and the orientation can also be defined by an orienting frame in 
some tangent plane TS X0 to .S'. 

The only new element that arises here and requires verification is the implicitly 
occurring proposition that follows. 
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Proposition 1 The mutual transitions from one curvilinear coordinate system to 
another on a smooth surface S C R" are diffeomorphisms of the same degree of 
smoothness as the charts of the surface . 

Proof In fact, by the proposition in Sect. 12.1, we can regard any chart I k —*■ U C 
S locally as the restriction to I k fl O(t) of a diffeomorphism T : Oft) — > 0(x) 
from some n -dimensional neighborhood O(t) of the point t e I k cl" to an n- 
dimensional neighborhood O (x ) of x e S C R", T being of the same degree of 
smoothness as tp. If now ipi : I k — »■ U\ and q >2 : l\ —■ > Ui are two such charts, then 
the action of the mapping tpf 1 °<P\ (the transition from the first coordinate system 
to the second) which arises in the common domain of action can be represented 
locally as cpf [ o <p\{t l , , t k ) — o lF(t l , . . . , t k , 0, . . . , 0), where F\ and Ti 
are the corresponding diffeomorphisms of the n -dimensional neighborhoods. □ 

We have studied all the essential components of the concept of an orientation 
of a surface using the example of an elementary surface defined by a single chart. 
We now finish up this business with the final definitions relating to the case of an 
arbitrary smooth surface in M" . 

Let S be a smooth ^-dimensional surface in R", and let cp, : I k — »■ U,, q>j : I k — »■ 
Uj be two local charts of the surface S whose domains of action intersect, that is, 
Ui fl Uj 0. Then between the sets I k j — tp~ l (Uj) and /|. = (pT^fUf), as was just 
proved, there are natural mutually inverse diffeomorphisms (p, j : I k . — >■ l k : and cpjj : 
Ijj — >■ l k j that realize the transition from one local curvilinear coordinate system on 
S to the other. 

Definition 1 Two local charts of a surface are consistent if their domains of ac- 
tion either do not intersect, or have a nonempty intersection for which the mutual 
transitions are effected by diffeomorphisms with positive Jacobian in their common 
domain of action. 

Definition 2 An atlas of a surface is an orienting atlas of the surface if it consists 
of pairwise consistent charts. 
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Definition 3 A surface is orientable if it has an orienting atlas. Otherwise it is 
nonorientcible. 

In contrast to domains of K" or elementary surfaces defined by a single chart, an 
arbitrary surface may turn out to be nonorientable. 

Example 1 The Mobius band, as one can verify (see Problems 2 and 3 at the end of 
this section), is a nonorientable surface. 

Example 2 The Klein bottle is also a nonorientable surface, since it contains a 
Mobius band. This last fact can be seen immediately from the construction of the 
Klein bottle shown in Fig. 12.5. 

Example 3 A circle and in general a ^-dimensional sphere are orientable, as can be 
proved by exhibiting directly an atlas of the sphere consisting of consistent charts 
(see Example 2 of Sect. 12.1). 

Example 4 The two-dimensional torus studied in Example 4 of Sect. 12.1 is also an 
orientable surface. Indeed, using the parametric equations of the torus exhibited in 
Example 4 of Sect. 12.1, one can easily exhibit an orienting atlas for it. 

We shall not go into detail, since a more visualizable method of controlling the 
orientability of sufficiently simple surfaces will be exhibited below, making it easy 
to verify the assertions in Examples 1-4. 

The formal description of the concept of orientation of a surface will be finished 
if we add Definitions 4 and 5 below to Definitions 1, 2, and 3. 

Two orienting atlases of a surface are equivalent if their union is also an orienting 
atlas of the surface. 

This relation is indeed an equivalence relation between orienting atlases of an 
orientable surface. 

Definition 4 An equivalence class of orienting atlases of a surface under this rela- 
tion is called an orientation class of atlases or simply an orientation of the surface. 

Definition 5 An oriented surface is a surface with a fixed orientation class of atlases 
(that is, a fixed orientation of the surface). 

Thus orienting a surface means exhibiting a particular orientation class of ori- 
enting atlases of the surface by some means or other. 

Some special manifestations of the following proposition are already familiar to 
us. 

Proposition 2 There exist precisely two orientations on a connected orientable sur- 
face. 
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They are usually called opposite orientations. 

The proof of Proposition 2 will be given in Sect. 15.2.3. 

If an orientable surface is connected, an orientation of it can be defined by speci- 
fying any local chart of the surface or an orienting frame in any of its tangent planes. 
This fact is widely used in practice. 

When a surface has more than one connected component, such a local chart or 
frame is naturally to be exhibited in each component. 

The following way of defining an orientation of a surface embedded in a space 
that already carries an orientation is widely used in practice. Let S be an orientable 
(n — 1) -dimensional surface embedded in the Euclidean space M" with a fixed ori- 
enting frame ei, . . . , e„ in M' ! . Let TS X be the (n — 1) -dimensional plane tangent 
to S at x e S, and n the vector orthogonal to TS X , that is, the vector normal to the 
surface S at x. If we agree that for the given vector n the frame §[,..., [ is to be 

chosen in TS X so that the frames (ei, . . . , e„) and (n, £ 1( . . . , §„_() = (ei, . . . , e„) 
belong to the same orientation class on R” , then, as one can easily see, such frames 
(!i, ...,§„) of the plane TS X will themselves all turn out to belong to the same ori- 
entation class for this plane. Hence in this case defining an orientation class for TS X 
and along with it an orientation on a connected orientable surface can be done by 
defining the normal vector n (Fig. 12.8). 

It is not difficult to verify (see Problem 4) that the orientability of an (n — 1)- 
dimensional surface embedded in the Euclidean space R" is equivalent to the exis- 
tence of a continuous field of nonzero normal vectors on the surface. 

Hence, in particular, the orientability of the sphere and the torus follow obviously, 
as does the nonorientability of the Mobius band, as was stated in Examples 1—4. 

In geometry the connected (n — 1) -dimensional surfaces in the Euclidean space 
R" on which there exists a (single-valued) continuous field of unit normal vectors 
are called two-sided. 

Thus, for example, a sphere, torus, or plane in R 3 is a two-sided surface, in 
contrast to the Mobius band, which is a one-sided surface in this sense. 

To finish our discussion of the concept of orientation of a surface, we make sev- 
eral remarks on the practical use of this concept in analysis. 

In the computations that are connected in analysis with oriented surfaces in R" 
one usually finds first some local parametrization of the surface S without bothering 
about orientation. In some tangent plane TS X to the surface one then constructs a 
frame £ 1; . . . , £ n _j consisting of (velocity) vectors tangent to the coordinate lines 
of a chosen curvilinear coordinate system, that is, the orienting frame induced by 
this coordinate system. 
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If the space R" has been oriented and an orientation of S has been defined by a 
field of normal vectors, one chooses the vector n of the given field at the point x 

and compares the frame n, §! § n -\ with the frame ej e„ that orients the 

space. If these are in the same orientation class, the local chart defines the required 
orientation of the surface in accordance with our convention. If these two frames are 
inconsistent, the chosen chart defines an orientation of the surface opposite to the 
one prescribed by the normal n. 

It is clear that when there is a local chart of an (n — 1) -dimensional surface, one 
can obtain a local chart of the required orientation (the one prescribed by the fixed 
normal vector n to the two-sided hypersurface embedded in the oriented space R" ) 
by a simple change in the order of the coordinates. 

In the one-dimensional case, in which a surface is a curve, the orientation is more 
often defined by the tangent vector to the curve at some point; in that case we often 
say the direction of motion along the curve rather than “the orientation of the curve”. 

If an orienting frame has been chosen in R 1 2 3 and a closed curve is given, the 
positive direction of circuit around the domain D bounded by the curve is taken to 
be the direction such that the frame n, v, where n is the exterior normal to the curve 
with respect to D and v is the velocity of the motion, is consistent with the orienting 
frame in R 2 . 

This means, for example, that for the traditional frame drawn in the plane a pos- 
itive circuit is “counterclockwise”, in which the domain is always “on the left”. 

In this connection the orientation of the plane itself or of a portion of the plane 
is often defined by giving the positive direction along some closed curve, usually a 
circle, rather than a frame in R 2 . 

Defining such a direction amounts to exhibiting the direction of shortest rota- 
tion from the first vector in the frame until it coincides with the second, which is 
equivalent to defining an orientation class of frames on the plane. 


12.2.1 Problems and Exercises 

1. Is the atlas of the sphere exhibited in Problem 3c) of Sect. 12.1 an orienting atlas 
of the sphere? 

2. a) Using Example 4 of Sect. 12.1, exhibit an orienting atlas of the two- 
dimensional torus. 

b) Prove that there does not exist an orienting atlas for the Mobius band. 

c) Show that under a diffeomorphism / : D — >■ D an orientable surface S C D 
maps to an orientable surface S C D. 

3. a) Verify that the curvilinear coordinate systems on a domain G C R" belonging 
to the same orientation class generate continuous fields of frames in G that deter- 
mine frames of the same orientation class on the space TG X at each point x e G. 

b) Show that in a connected domain G C R" the continuous fields of frames 
divide into exactly two orientation classes. 
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c) Use the example of the sphere to show that a smooth surface S C R" may be 
orientable even those there is no continuous field of frames in the tangent spaces 
to S. 

d) Prove that on a connected orientable surface one can define exactly two dif- 
ferent orientations. 

4. a) A subspace has been fixed, a vector v e has been chosen, 

along with two frames (£;, . . . , £„_i) and (£j, . . . , £„_j) of the subspace M" . 
Verify that these frames belong to the same orientation class of frames of R" -1 
if and only if the frames (v, . . . , §„_i) and (v, , §„_i) define the same 

orientation on M" . 

b) Show that a smooth hypersurface S C R" is orientable if and only if there 
exists a continuous field of unit normal vectors to S. Hence, in particular, it follows 
that a two-sided surface is orientable. 

c) Show that if grad F rf 0, then the surface defined by Fix 1 , . . . , x m ) = 0 is 
orientable (assuming that the equation has solutions). 

d) Generalize the preceding result to the case of a surface defined by a system 
of equations. 

e) Explain why not every smooth two-dimensional surface in R 3 can be defined 
by an equation F{x,y,z) — 0, where F is a smooth surface having no critical points 
(a surface for which grad F rf 0 at all points). 


12.3 The Boundary of a Surface and Its Orientation 
12.3.1 Surfaces with Boundary 

Let Mr be a Euclidean space of dimension k endowed with Cartesian coordinates 
t l , ... ,t k . Consider the half-space H k := {t e M k \ t 1 < 0} of the space K*. The hy- 
perplane 3 H k := {t e R* | t 1 = 0} will be called the boundary of the half-space H k . 

We remark that the set H k := H k \d II k . that is, the open part of H k , is an ele- 
mentary k-dimensional surface. The half-space H k itself does not formally satisfy 
the definition of a surface because of the presence of the boundary points from 3 H k . 
The set Fl k is the standard model for surfaces with boundary, which we shall now 
describe. 

Definition 1 A set S C R" is a (k-dimensional) surface with boundary if every point 
x e S has a neighborhood U in S homeomorphic either to R* or to H k . 

Definition 2 If a point x e U corresponds to a point of the boundary 3 H k under the 
homeomorphism of Definition 1 , then x is called a boundary point of the surface 
(with boundary) S and of its neighborhood U. The set of all such boundary points 
is called the boundary of the surface S. 
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As a rule, the boundary of a surface S will be denoted 3 S. We note that for k = I 
the space 3 H k consists of a single point. Hence, preserving the relation 3 H k — 
R* -1 , we shall from now on take R° to consist of a single point and regard 3R° as 
the empty set. 

We recall that under a homeomorphism (pij : G, — > G of the domain G, C R^ 
onto the domain G ; C R A the interior points of G, map to interior points of the im- 
age <Pij(Gi ) (this is a theorem of Brouwer). Consequently, the concept of a boundary 
point of the surface is independent of the choice of the local chart, that is, the con- 
cept is well defined. 

Formally Definition 1 includes the case of the surface described in Definition 1 
of Sect. 12.1. Comparing these definitions, we see that if S has no boundary points, 
we return to our previous definition of a surface, which can now be regarded as 
the definition of a surface without boundary. In this connection we note that the 
term “surface with boundary” is normally used when the set of boundary points is 
nonempty. 

The concept of a smooth surface S (of class C ( " !) ) with boundary can be intro- 
duced, as for surfaces without boundary, by requiring that S have an atlas of charts of 
the given smoothness class. When doing this we assume that for charts of the form 
(p : H k —> U the partial derivatives of ip are computed at points of the boundary 
3 H k only over the domain II 1 of definition of the mapping tp, that is, these deriva- 
tives are sometimes one-sided, and that the Jacobian of the mapping (p is nonzero 
throughout H k . 

Since IK. ” can be mapped to the cube I — {/ £ IK ' | 1 1 ' | < c. 1 , i — 1, ...,k} by a 
diffeomorphism of class C’ foo) and in such a way that H k maps to the portion l k H of 
the cube I k defined by the additional condition t 1 < 0, it is clear that in the definition 
of a surface with boundary (even a smooth one) we could have replaced R A ’ by l k 
and H k by l k H or by the cube T k with one of its faces attached: I k ~ l := {f e R k \ 
t 1 = 1, \t l | < 1, i = 2, . . . , k}, which is obviously a cube of dimension one less. 

Taking account of this always-available freedom in the choice of canonical local 
charts of a surface, comparing Definitions 1 and 2 and Definition 1 of Sect. 12.1, we 
see that the following proposition holds. 

Proposition 1 The boundary of a k-dimensional surface of class C <J "^ is itself a 
surface of the same smoothness class, and is a surface without boundary having 
dimension one less than the dimension of the original suiface with boundary. 

Proof Indeed, if A(S) = {(H k , q>j , £/,•)} U { (1R* , q)j , U /)} is an atlas for the surface S 
with boundary, then AOS') = {(R A_1 , (pi | 3 ^a =r a--i , 3 1/,-)} is obviously an atlas of 
the same smoothness class for 8S. □ 

We now give some simple examples of surfaces with boundary. 

Example 1 A closed n -dimensional ball B" in R" is an n -dimensional surface 
with boundary. Its boundary 3 B is the (n — 1) -dimensional sphere (see Figs. 12.8 
and 12.9a). The ball B , which is often called in analogy with the two-dimensional 
case an n -dimensional disk, can be homeomorphically mapped to half of an 
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Fig. 12.9 


Fig. 12.10 


Fig. 12.11 







1 

1 



n -dimensional sphere whose boundary is the equatorial (n — 1) -dimensional sphere 
(Fig. 12.9b). 

Example 2 The closed cube i" in R" can be homeomorphically mapped to the 

fl — fl Yl 

closed ball 3 B along rays emanating from its center. Consequently 1 , like B is 
an n -dimensional surface with boundary, which in this case is formed by the faces 
of the cube (Fig. 12.10). We note that on the edges, which are the intersections of 
the faces, it is obvious that no mapping of the cube onto the ball can be regular (that 
is, smooth and of rank n). 

Example 3 If the Mobius band is obtained by gluing together two opposite sides of 
a closed rectangle, as described in Example 5 of Sect. 12.1, the result is obviously a 
surface with boundary in R 3 , and the boundary is homeomorphic to a circle (to be 
sure, the circle is not knotted in R 3 ). 

Under the other possible gluing of these sides the result is a cylindrical surface 
whose boundary consists of two circles. This surface is homeomorphic to the usual 
planar annulus (see Fig. 12.3 and Example 5 of Sect. 12.1). Figures 12.11a, 12.11b, 
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Fig. 12.12 


Fig. 12.13 



12.12a, 12.12b, 12.13a, and 12.13b, which we will use below, show pairwise home- 
omorphic surfaces with boundary embedded in R 2 and R 3 . As one can see, the 
boundary of a surface may be disconnected, even when the surface itself is con- 
nected. 


12.3.2 Making the Orientations of a Surface and Its Boundary 
Consistent 

If an orienting orthoframe ei , . . . , e/,- that induces Cartesian coordinates x l , ... ,x k 
is fixed in R*, the vectors ej, . . . , e& define an orientation on the boundary 3 H k — 
M i _l of 3 H k = {x e P/’ 1 1 1 < 0} which is regarded as the orientation of the half- 
space H k consistent with the orientation of the half-space H k given by the frame 
ei, . . . , e*. 

In the case k — 1 where 3 H k — R* -1 = R° is a point, a special convention needs 
to be made as to how to orient the point. By definition, the point is oriented by 
assigning a sign + or — to it. In the case 3 H l = R , we take (R°, +), or more 
briefly +R°. 

We now wish to determine what is meant in general by consistency of the orien- 
tation of a surface and its boundary. This is very important in carrying out compu- 
tations connected with surface integrals, which will be discussed below. 

We begin by verifying the following general proposition. 

Proposition 2 The boundary 3 S of a smooth orientable surface S is itself a smooth 
orientable surface (although possibly not connected). 

Proof After we take account of Proposition 1 all that remains is to verify that 
3 S is orientable. We shall show that if A(S) = {( H k ,(pt , [/,•)} U { (R* , cpj , U j)} 
is an orienting atlas for a surface S with boundary, then the atlas A(dS) = 
{ (R* — 1 , dUj)} of the boundary also consists of pairwise consistent 
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charts. To do this it obviously suffices to verify that if t = fit) is a diffeomor- 
phism with positive Jacobian from an H k -neighborhood U H k (to) of the point to in 
3 H k onto an H k -neighborhood U H t (to) of the point to e 3 H k , then the mapping 
^\dU Hk (i 0 ) f rom the H k -neighborhood U aH k(to) = dU H k(to) of to e 3 H k onto the 
/H -neighborhood U aH k(to) = dU H k(to) of to — f(to) e 3 H k also has a positive 
Jacobian. 

We remark that at each point to = (0, tg , . . . , tg) e dH k the Jacobian J of the 
mapping \[r has the form 


3 l/d 

3i/f 2 

3r* 
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dlj/ 2 

dr 2 
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since for t 1 = 0 we must also have t 1 = f 1 (0, t 2 , . . . , t k ) = 0 (boundary points map 
to boundary points under a diffeomorphism). It now remains only to remark that 
when t 1 < 0 we must also have t = , t 2 , ... ,t k ) < 0 (since t — f(t) e H k ), 

so that the value of Hj-(0, t 2 , . . . , t k ) cannot be negative. By hypothesis /(to) > 0, 
and since ^-(0, t 2 , . . . , t k ) > 0 it follows from the equality given above connect- 
ing the determinants that the Jacobian of the mapping f\au Hk = (HO, t 2 , . . . , t k ) is 
positive. □ 

We note that the case of a one-dimensional surface (k = 1 ) in Proposition 2 and 
Definition 3 below must be handled by a special convention in accordance with the 
convention adopted at the beginning of this subsection. 

Definition3 If A(S) = \(H k , <p,. (/, )} fl {(R*, <pj, Uj)} is an orienting atlas of 
standard local charts of the surface S with boundary 3 S, then A (3 S) — {(R A_1 , 
tp\ aH k = ^k-i , dU,)} is an orienting atlas for the boundary. The orientation of 3 S that 
it defines is said to be the orientation consistent with the orientation of the surface. 

To finish our discussion of orientation of the boundary of an orientable surface, 
we make two useful remarks. 

Remark 1 In practice, as already noted above, an orientation of a surface embedded 
in R" is often defined by a frame of tangent vectors to the surface. For that reason, 
the verification of the consistency of the orientation of the surface and its boundary 
in this case can be carried out as follows. Take a ^-dimensional plane TS X 0 tangent to 
the smooth surface S at the point xo € 35. Since the local structure of S near.ro is the 
same as the structure of the half-space H k near 0 e 3 H k , directing the first vector 
of the orthoframe € TS Xq along the normal to dS and in the direction 

exterior to the local projection of S on TS Xg , we obtain a frame £ 2 1- k in the 
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(. k — l)-dimensional plane TdS X0 tangent to 3 S at xq , which dehnes an orientation 
of TdS X0 , and hence also of 3 S, consistent with orientation of the surface S defined 
by the given frame § ; , £ 2 > ■ ■ ■ > $k ■ 


Figures 12.9-12.12 show the process and the result of making the orientations of 
a surface and its boundary consistent using a simple example. 

We note that this scheme presumes that it is possible to translate a frame that 
defines the orientation of S to different points of the surface and its boundary, which, 
as examples show, may be disconnected. 

Remark 2 In the oriented space R /: we consider the half-space H k = H k = {x e 
R 4 | x 1 < 0} and H\ = {x e | x 1 > 0} with the orientation induced from . The 
hyperplane r — {x e R* | x 1 = 0} is the common boundary of //* and H\ . It is 
easy to see that the orientations of the hyperplane r consistent with the orientations 
of H k and //£ are opposite to each other. This also applies to the case k = 1 , by 
convention. 

Similarly, if an oriented ^'-dimensional surface is cut by some ( k — 1) -dimensional 
surface (for example, a sphere intersected by its equator), two opposite orientations 
arise on the intersection, induced by the parts of the original surface adjacent to it. 

This observation is often used in the theory of surface integrals. 

In addition, it can be used to determine the orientability of a piecewise-smooth 
surface. 

We begin by giving the definition of such a surface. 

Definition 4 (Inductive definition of a piecewise-smooth surface) We agree to call 
a point a zero-dimensional surface of any smoothness class. 

A piecewise smooth one-dimensional surface (piecewise smooth curve) is a curve 
in R" which breaks into smooth one-dimensional surfaces (curves) when a finite or 
countable number of zero-dimensional surfaces are removed from it. 

A surface S C R" of dimension k is piecewise smooth if a finite or countable 
number of piecewise smooth surfaces of dimension at most k — I can be removed 
from it in such a way that the remainder decomposes into smooth ^-dimensional 
surfaces Sj (with boundary or without). 

Example 4 The boundary of a plane angle and the boundary of a square are 
piecewise-smooth curves. 

The boundary of a cube or the boundary of a right circular cone in R 3 are two- 
dimensional piecewise-smooth surfaces. 

Let us now return to the orientation of a piecewise-smooth surface. 

A point (zero-dimensional surface), as already pointed out, is by convention ori- 
ented by ascribing the sign + or — to it. In particular, the boundary of a closed 
interval [a,b] C R, which consists of the two points a and b is by convention con- 
sistent with the orientation of the closed interval from a to b if the orientation is 
(a, — ), ( b , +), or, in another notation, —a, +b. 
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Now let us consider a /.'-dimensional piecewise smooth surface S C R" (k > 0). 
We assume that the two smooth surfaces Sj l and S; 2 in Definition 4 are ori- 
ented and abut each other along a smooth portion r of a (k — 1) -dimensional sur- 
face (edge). Orientations then arise on r, which is a boundary, consistent with the 
orientations of S, , and S, 2 . If these two orientations are opposite on every edge 
F C S h n Si 2 , the original orientations of .S', , and .S, 2 are considered consistent. If 
5,-j fl .S/ 2 is empty or has dimension less than ( k — 1), all orientations of ,S) , and .S, 2 
are consistent. 

Definition 5 A piecewise-smooth Z:-dimensional surface (k > 0) will be considered 
orientable if up to a finite or countable number of piecewise-smooth surfaces of 
dimension at most (k — 1) it is the union of smooth orientable surfaces .S', any two 
of which have a mutually consistent orientation. 

Example 5 The surface of a three-dimensional cube, as one can easily verify, is an 
orientable piecewise-smooth surface. In general, all the piecewise-smooth surfaces 
exhibited in Example 4 are orientable. 

Example 6 The Mobius band can easily be represented as the union of two ori- 
entable smooth surfaces that abut along a piece of the boundary. But these surfaces 
cannot be oriented consistently. One can verify that the Mobius band is not an ori- 
entable surface, even from the point of view of Definition 5. 


12.3.3 Problems and Exercises 

1. a) Is it true that the boundary of a surface S C R" is the set S\S, where S is the 
closure of S in M" ? 

b) Do the surfaces Si = {(x, y) e R 2 | 1 < x 2 + y 2 < 2} and S 2 = {(x, y) | 0 < 
x 2 + y 2 } have a boundary? 

c) Give the boundary of the surfaces Si = {(x, y) e R 2 | 1 < x 2 + y 2 < 2} and 
S 2 = {(x, y) e R 2 | 1 < x 2 + y 2 }. 

2. Give an example of a nonorientable surface with an orientable boundary. 

3. a) Each face I k = [x € K* | |x‘ | < 1, i = 1, . . . , k] is parallel to the correspond- 
ing (k — 1) -dimensional coordinate hyperplane in , so that one may consider the 
same frame and the same coordinate system in the face as in the hyperplane. On 
which faces is the resulting orientation consistent with the orientation of the cube 
I k induced by the orientation of M*, and on which is it not consistent? Consider 
successively the cases k = 2, k — 3, and k = n. 

b) The local chart (t l ,t 2 ) i->- (sinf 2 cos t 2 , sint 2 sinf 2 , cost 1 ) acts in a certain 
domain of the hemisphere S = {(x, y, z) e K 3 | x 2 + y 2 + z 2 = 1 A z > 0}, and 
the local chart t (cost, sinf, 0) acts in a certain domain of the boundary <)S of 
this hemisphere. Determine whether these charts give a consistent orientation of the 
surface S and its boundary 3 S. 
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c) Construct the field of frames on the hemisphere S and its boundary 3 S in- 
duced by the local charts shown in b). 

d) On the boundary dS of the hemisphere S exhibit a frame that defines the 
orientation of the boundary consistent with the orientation of the hemisphere given 
in c). 

e) Define the orientation of the hemisphere S obtained in c) using a normal 
vector toSct 3 . 

4 . a) Verify that the Mobius band is not an orientable surface even from the point 
of view of Definition 5. 

b) Show that if 5 is a smooth surface in R", determining its orientability as a 
smooth surface and as a piecewise-smooth surface are equivalent processes. 

5 . a) We shall say that a set S C R" is a /.’-dimensional surface with boundary if 
for each point x e S there exists a neighborhood U (x ) e R" and a diffeomorphism 
i [r : U ( x ) — > /” of this neighborhood onto the standard cube I" C R" under which 

fl t/(x)) coincides either with the cube I k = [t e I" \ t k+l — ■ ■ ■ = t n = 0} or 
with a portion of it I k fl {f e M' ! | t k < 0} that is a ^-dimensional open interval with 
one of its faces attached. 

Based on what was said in Sect. 12. 1 in the discussion of the concept of a surface, 
show that this definition of a surface with boundary is equivalent to Definition 1. 

b) Is it true that if / e C^(H k , R), where H k = {x e R* | x 1 < 0}, then for 
every point x e 3 H k one can find a neighborhood of it U (x) in M A and a function 
T e C (/) (U (x), R) such that F\H k n u(x) — f\H k nu(x)^ 

c) If the definition given in part a) is used to describe a smooth surface with 
boundary, that is, we regard i/r as a smooth mapping of maximal rank, will this 
definition of a smooth surface with boundary be the same as the one adopted in 
Sect. 12.3? 


12.4 The Area of a Surface in Euclidean Space 

We now turn to the problem of defining the area of a /.'-dimensional piecewise- 
smooth surface embedded in the Euclidean space R n , n > k. 

We begin by recalling that if £ j , . . . , l- k are k vectors in Euclidean space R*, then 
the volume V(§ ( , . . . , of the parallelepiped spanned by these vectors as edges 
can be computed as the determinant 

V($ 1 ,....«jt) = det($/) (12.6) 

of the matrix J — (^j) whose rows are formed by the coordinates of these vectors 
in some orthonormal basis ei, . . . , e* of R*. We note, however, that in actual fact 
formula (12.6) gives the so-called oriented volume of the parallelepiped rather than 
simply the volume. If V ^ 0, the value of V given by (12.6) is positive or negative 
according as the frames ei , . . . , e* and § | ..... ^ belong to the same or opposite 
orientation classes of R* . 
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Fig. 12.14 



We now remark that the product J J* of the matrix J and its transpose J* has 
elements that are none other than the matrix G = (gij) of pairwise inner products 
gij — (§,■,§ j) of these vectors, that is, the Gram matrix 8 of the system of vectors 

h- Thus 


det G = det(//*) = det J det J* = (det J) 2 , 


(12.7) 


and hence the nonnegative value of the volume V(£ j , . . . , §*) can be obtained as 



( 12 . 8 ) 


This last formula is convenient in that it is essentially coordinate-free, containing 
only a set of geometric quantities that characterize the parallelepiped under consid- 
eration. In particular, if these same vectors ... ,l; k are regarded as embedded in 
n -dimensional Euclidean space R" (n > k), formula (12.8) for the ^-dimensional 
volume (or A:-dimensional surface area) of the parallelepiped they span remains un- 
changed. 

Now let r : D — > S C R" be a /.--dimensional smooth surface S in the Euclidean 
space R" defined in parametric form r = r(t 1 , . . . , t k ), that is, as a smooth vector- 
valued function r(f) = (x 1 , . . . , x n )(t ) defined in the domain Del* 1 ’. Let ei, . . . , e k 
be the orthonormal basis in M* that generates the coordinate system (f 1 , . . . , t k ). 
After fixing a point to = (?q , . . . , tfa ) e D, we take the positive numbers h 1 , . . . , h k 
to be so small that the parallelepiped I spanned by the vectors h'e, e TD Io , i = 
1, . . . , k, attached at the point to is contained in D. 

Under the mapping D S a figure Is on the surface .S', which we may provi- 
sionally call a curvilinear parallelepiped, corresponds to the parallelepiped I (see 
Fig. 12.14, which corresponds to the case k = 2, n — 3). Since 



= —(t 0 )h i +°(h i ), 


jitoW +o(h‘), 


See the footnote on p. 497. 
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a displacement in R" from r(fo) that can be replaced, up to o(h‘), by the partial dif- 
ferential jp-(to)h' =: Yili‘ as h‘ — » 0 corresponds to displacement from to by /t'e, . 
Thus, for small values of h ‘ , i = 1, . . . , k, the curvilinear parallelepiped 1$ differs 
only slightly from the parallelepiped spanned by the vectors h x r\, . . . ,h‘ tangent 
to the surface S at r(fo). Assuming on that basis that the volume A V of the curvilin- 
ear parallelepiped Is must also be close to the volume of the standard parallelepiped 
just exhibited, we find the approximate formula 

AV * ^<\zt( gij )(to)At l At k , (12.9) 

where we have set gij(to ) = (r,-, r / ) (?o) and At' = h' , i, j — 1 , . . . , k. 

If we now tile the entire space M k containing the parameter domain D with k- 
dimensional parallelepipeds of small diameter d, take the ones that are contained 
in D, compute an approximate value of the £ -dimensional volume of their images 
using formula (12.9), and then sum the resulting values, we arrive at the quantity 

^ Jdet(gij)(t a )At l ■ At k , 

a 

which can be regarded as an approximation to the /: -dimensional volume or area of 
the surface S under consideration, and this approximate value should become more 
precise as d —*■ 0. Thus we adopt the following definition. 

Definition 1 The area (or k-dimensional volume) of a smooth /.-dimensional sur- 
face S given parametrically by D 3 t —> r{t) e S and embedded in the Euclidean 
space R" is the quantity 

V k (S) := J yj det((r;, i* ; ->) dr 1 • • • dt k . (12.10) 

Let us see how formula (12.10) looks in the cases that we already know about. 
For k = 1 the domain D C R 1 is an interval with certain endpoints a and b 
( a < b) on the line R 1 , and S is a curve in R" in this case. Thus for k = 1 formula 
(12.10) becomes the formula 

Li (S)= f |r(t)|d t=[ J(.i l ) 2 + --- + {k")\t)dt 
J a J a 

for computing the length of a curve. 

If k — n, then S is an n -dimensional domain in R" diffeomorphic to I). In this 
case the Jacobian matrix / = x'(t) of the mapping D 3 (f 1 , . . . , t n ) = t h->- r(f) = 
(jc, . . . , x n )(t) e S is a square matrix. Now using relation (12.7) and the formula 
for change of variable in a multiple integral, one can write 

V n (S) = f v / detG(f)dr= [ |detx'(f)| dt = f dx = V(S). 

Jd Jd Js 
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That is, as one should have expected, we have arrived at the volume of the domain 
S in K". 

We note that for k — 2, n = 3, that is, when S is a two-dimensional surface in R 3 , 
one often replaces the standard notation gjj = (r, , r,-} by the following: a := V^( S), 
E := gn = (i-i.ri), F := g 12 = g 21 = (ri,r 2 ), G := g 22 = (: r 2 , r 2 >; and one writes 
u, v respectively instead of f 1 , t 2 . In this notation formula (12.10) assumes the form 



a — 


In particular, if u = x, v — v, and the surface S is the graph of a smooth real- 
valued function z = f(x,y) defined in a domain Del 2 , then, as one can easily 
compute, 



We now return once again to Definition 1 and make a number of remarks that 
will be useful later. 

Remark 1 Definition 1 makes sense only when the integral on the right-hand side 
of (12.10) exists. It demonstrably exists, for example, if D is a Jordan-measurable 
domain and re C (13 (D,M"). 

Remark 2 If the surface S in Definition 1 is partitioned into a finite number of sur- 
faces Si, , S m with piecewise smooth boundaries, the same kind of partition of 
the domain D into domains D 1 , . . . , D m corresponding to these surfaces will corre- 
spond to this partition. If the surface S had area in the sense of Eq. (12.10), then the 
quantities 

Vk(S a )= [ Jdet(rj, r/)(t)dt 

JD a V 

are defined for each value of a — I .... , m . 

By the additivity of the integral, it follows that 


Vk(S) = £v*(S«). 


O' 


We have thus established that the area of a ^'-dimensional surface is additive in 
the same sense as the ordinary multiple integral. 

Remark 3 This last remark allows us to exhaust the domain D when necessary, and 
thereby to extend the meaning of the formula (12.10), in which the integral may 
now be interpreted as an improper integral. 


Remark 4 More importantly, the additivity of area can be used to define the area of 
an arbitrary smooth or even piecewise smooth surface (not necessarily given by a 
single chart). 
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Definition 2 Let S be an arbitrary piecewise smooth A: -dimensional surface in R" . 
If, after a finite or countable number of piecewise smooth surfaces of dimension at 
most k — 1 are removed, it breaks up into a finite or countable number of smooth 
parametrized surfaces Si , . . . , S m , . . . , we set 

V k (S) := V k (S a ). 

a 


The additivity of the multiple integral makes it possible to verify that the quantity 
Vk(S) so defined is independent of the way in which the surface S is partitioned into 
smooth pieces S\, S m , , each of which is contained in the range of some local 
chart of the surface S. 

We further remark that it follows easily from the definitions of smooth and piece- 
wise smooth surfaces that the partition of S into parametrized pieces, as described 
in Definition 2, is always possible, and can even be done while observing the natural 
additional requirement that the partition be locally finite. The latter means that any 

compact set K C S can intersect only a finite number of the surfaces S\, ...,S m , 

This can be expressed more vividly in another way: every point of S must have a 
neighborhood that intersects at most a finite number of the sets S\, ... ,S m , 


Remark 5 The basic formula (12.10) contains a system of curvilinear coordinates 
t l , ... , t k . For that reason, it is natural to verify that the quantity Vk(S) defined by 
(12.10) (and thereby also the quantity Vk(S) from Definition 2) is invariant under a 
diffeomorphic transition D 3 (f 1 , . ... t k ) m>- t = (f 1 , . ... t k ) e D to new curvilin- 
ear coordinates t 1 , . . . , t k varying in the domain Del 1 . 


Proof For the verification it suffices to remark that the matrices 


3r 9r 


dt 1 dtJ 


G = (8ij) = I ( 7T7 > 777 ) ) and G = ( gi j)= 


3r 3r 


dt' dtJ 


at corresponding points of the domains D and I) are connected by the relation G — 
J*GJ, where J = (w) is the Jacobian matrix of the mapping D 3 t i->- t e D 
and J* is the transpose of the matrix J . Thus, det G(t) = det G(f)(det J) 2 (t), from 
which it follows that 



et G(t) dr 


J~ det G(t (t))\j (J) \ dt — detG(?)d7. 


□ 


Thus, we have given a definition of the A:-dimensional volume or area of a k- 
dimensional piecewise- smooth surface that is independent of the choice of coordi- 
nate system. 


Remark 6 We precede the remark with a definition. 
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Definition 3 A set E embedded in a A: -dimensional piecewise-smooth surface S is 
a set of k -dimensional measure zero or has area zero in the Lebesgue sense if for 
every e > 0 it can be covered by a finite or countable system S\, . . .,S m , . . . of 
(possibly intersecting) surfaces S a C S such that Vk(S a ) < e. 

As one can see, this is a verbatim repetition of the definition of a set of measure 
zero in . 

It is easy to see that in the parameter domain D of any local chart cp : D —*■ S of 
a piecewise-smooth surface S the set < p~ l (E) Cfld 1 of A: -dimensional measure 
zero corresponds to such a set E. One can even verify that this is the characteristic 
property of sets E C S of measure zero. 

In the practical computation of areas and the surface integrals introduced below, 
it is useful to keep in mind that if a piecewise-smooth surface S has been obtained 
from a piecewise-smooth surface S by removing a set E of measure zero from S, 
then the areas of S and S are the same. 

The usefulness of this remark lies in the fact that it is often easy to remove such 
a set of measure zero from a piecewise-smooth surface in such a way that the result 
is a smooth surface S defined by a single chart. But then the area of S and hence the 
area of S also can be computed directly by formula (12.10). 

Let us consider some examples. 

Example 1 The mapping ]0, 2 tt[ ( K cos t, R sin t) e K 2 is a chart for the 

arc S of the circle x 2 + y 2 = R 2 obtained by removing the single point E = ( R . 0) 
from that circle. Since E is a set of measure zero on S, we can write 

Vi(S) = Vi(S)= [ V R 2 sin 2 1 + R 2 cos 2 1 dr = lit R . 

Jo 

Example 2 In Example 4 of Sect. 12.1 we exhibited the following parametric rep- 
resentation of the two-dimensional torus S in R 3 : 

r {up, if) = {fb + a cos if) cos ip, (b + a cos if) simp, a sin i/r). 

In the domain D = {(cp, f) \ 0 < (p < 2n, 0 < i jr < 2 n} the mapping (<p, f) i->- 
r((p, f ) is a diffeomorphism. The image S of the domain D under this diffeomor- 
phism differs from the torus by the set E consisting of the coordinate line <p = 2tt 
and the line -jr — 2 jt. The set E thus consists of one parallel of latitude and one 
meridian of longitude of the torus, and, as one can easily see, has measure zero. 
Hence the area of the torus can be found by formula (12.10) starting from this para- 
metric representation, considered within the domain D. 

Let us carry out the necessary computations: 

r v = (— ( b + a cos i/r) sirup, (b + a cos i jr) cos <p, 0), 
r^- = (—a sin f) cos cp, —a sin \jr simp, a cos if), 
g it = = (b + a cos if) 2 . 
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g 12 = g 21 = {r v ,Tf) = 0 , 
g22 = (i'^’ ^ ’ 
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g 21 g 22 


= a 2 {b + a cosif/) 2 . 


Consequently, 

V2(S) — V2(S)= I d (pi a(b a cos i/r) di/s = 4jz 2 ab. 

Jo Jo 

In conclusion we note that the method indicated in Definition 2 can now be used 
to compute the areas of piecewise-smooth curves and surfaces. 


12.4.1 Problems and Exercises 

1. a) Let P and P be two hyperplanes in the Euclidean space R" , D a subdo- 
main of P , and D the orthogonal projection of D on the hyperplane P . Show 
that the (n — 1) -dimensional areas of D and I) are connected by the relation 
o(D) = o (D) cos a, where a is the angle between the hyperplanes P and P. 

b) Taking account of the result of a), give the geometric meaning of the formula 
der = fl + (fx) 2 + ( fy ) 2 d.v dy for the element of area of the graph of a smooth 
function z = f(x,y) in three-dimensional Euclidean space. 

c) Show that if the surface S in Euclidean space R 3 is defined as a smooth 
vector- valued function r = r(u, v) defined in a domain D C R 2 , then the area of 
the surface S can be found by the formula 



where [rj ( , r'J is the vector product of and 

d) Verify that if the surface S C R 3 is defined by the equation F{x, y, z) = 0 and 
the domain U of the surface S projects orthogonally in a one-to-one manner onto 
the domain D of the xy-plane, we have the formula 



2. Find the area of the spherical rectangle formed by two parallels of latitude and 
two meridians of longitude of the sphere S C R 3 . 

3. a) Let (r, (p. h) be cylindrical coordinates in R 3 . A smooth curve lying in the 
plane ip — cpo and defined there by the equation r = r(.v), where ,v is the arc length 
parameter, is revolved about the /?-axis. Show that the area of the surface obtained 
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by revolving the piece of this curve corresponding to the closed interval [jj, 52 ] of 
variation of the parameter .v can be found by the formula 

rs 2 

o — 2 tx I r(s ) ds. 

Jsi 

b) The graph of a smooth nonnegative function y = fix) defined on a closed 
interval [a, b] C R+ is revolved about the .r-axis, then about the y-axis. In each of 
these cases, write the formula for the area of the corresponding surface of revolution 
as an integral over the closed interval [a, b], 

4. a) The center of a ball of radius 1 slides along a smooth closed plane curve of 
length L. Show that the area of the surface of the tubular body thereby formed is 
2n ■ 1 • L . 

b) Based on the result of part a), find the area of the two-dimensional torus 
obtained by revolving a circle of radius a about an axis lying in the plane of the 
circle and lying at distance b > a from its center. 

5. Describe the helical surface defined in Cartesian coordinates (x , y, z.) in R 3 by 
the equation 

Z 7T 

v = xtan-=0, \z\<—h, 
h 2 

and find the area of the portion of it for which r 2 < x 2 + y 2 < R 2 . 

6 . a) Show that the area C2 n -\ of the unit sphere in R" is ~ { ^H\ , where r(a) = 

f£° e~ x x a ~ l d.r. (In particular, if n is even, then D(|) = while if n is odd, 

r( = 

2 2 

( /n) n 

b) By verifying that the volume V n (r) of the ball of radius r in M' 7 is w n + 2 

r ( 2 ) 

show that ^f-\r=l = &n-l- 

c) Find the limit as n — »■ 00 of the ratio of the area of the hemisphere {ieR" | 
\x\ = 1 A x n > 0} to the area of its orthogonal projection on the plane x n = 0. 

d) Show that as n — > 00 , the majority of the volume of the n-dimensional ball is 
concentrated in an arbitrarily small neighborhood of the boundary sphere, and the 
majority of the area of the sphere is concentrated in an arbitrarily small neighbor- 
hood of its equator. 

e) Show that the following beautiful corollary on concentration phenomena fol- 
lows from the observation made in d). 

A regular function that is continuous on a sphere of large dimension is nearly 
constant on it (recall pressure in thermodynamics). 

Specifically, let us consider, for example, functions satisfying a Lipschitz condi- 
tion with a fixed constant. Then for any e > 0 and S > 0 there exists N such that 
for n > N and any function / : S n — »• R there exists a value c with the following 
properties: the area of the set on which the value of / differs from c by more than e 
is at most <5 times the area of the whole sphere. 
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7. a) Let x \ , . . . , Xk be a system of vectors in Euclidean space W ,n> k. Show that 
the Gram determinant of this system can be represented as 

det((x,- ,*;•))= E P£~ it , 

1 <z i <"-<ij c <n 


where 


Pi v -i k = det 


( x', 1 


l\ 

W 


x[ k ^ 
x k> 


b) Explain the geometric meaning of the quantities Pi v ~i k from a) and state the 
result of a) as the Pythagorean theorem for measures of arbitrary dimension k, 
l <k <n. 

c) Now explain the formula 




/ dxb 

3d ' 

dxh \ 

' 3d 

G= f 

Jd 

\ 

E det2 



l<il <~-<ik<n 


3 x‘k 

3d ' 


for the area of a k-dimensional surface given in the parametric form x = 

x(t l , t k ), t e D c R*. 

8. a) Verify that the quantity V/fS) in Definition 2 really is independent of the 

method of partitioning the surface S into smooth pieces S\, . S m , 

b) Show that a piecewise-smooth surface S admits the locally finite partition 
into pieces S\, . . . , S m , . . . described in Definition 2. 

c) Show that a set of measure 0 can always be removed from a piecewise-smooth 
surface S so as to leave a smooth surface S — S\E that can be described by a single 
standard local chart tp : I —*■ S. 

9. The length of a curve, like the high-school definition of the circumference of a 
circle, is often defined as the limit of the lengths of suitably inscribed broken lines. 
The limit is taken as the length of the links in the inscribed broken lines tend to 
zero. The following simple example, due to H. Schwarz, shows that the analogous 
procedure in an attempt to define the area of even a very simple smooth surface in 
terms of the areas of polyhedral surfaces “inscribed” in it, may lead to an absurdity. 

In a cylinder of radius R and height H we inscribe a polyhedron as follows. 
Cut the cylinder into m equal cylinders each of height H /m by means of horizontal 
planes. Break each of the m + 1 circles of intersection (including the upper and lower 
bases of the original cylinder) into n equal parts so that the points of division on each 
circle lie beneath the midpoints of the points of division of the circle immediately 
above. We now take a pair of division points of each circle and the point lying 
directly above or below the midpoint of the arc whose endpoints they are. 
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These three points form a triangle, and the set of all such triangles forms a poly- 
hedral surface inscribed in the original cylindrical surface (the lateral surface of a 
right circular cylinder). In shape this polyhedron resembles the calf of a boot that 
has been crumpled like an accordion. For that reason it is often called the Schwarz 
boot. 


a) Show that if m and n are made to tend to infinity in such a way that the 
ratio n 2 /m tends to zero, then the area of the polyhedral surface just constructed 
will increase without bound, even though the dimensions of each of its faces (each 
triangle) tend to zero. 

b) If n and m tend to infinity in such a way that the ratio m/n 2 tends to some 
finite limit p, the area of the polyhedral surfaces will tend to a finite limit, which 
may be larger than, smaller than, or (when p — 0) equal to the area of the original 
cylindrical surface. 

c) Compare the method of introducing the area of a smooth surface described 
here with what was just done above, and explain why the results are the same in the 
one-dimensional case, but in general not in the two-dimensional case. What are the 
conditions on the sequence of inscribed polyhedral surfaces that guarantee that the 
two results will be the same? 


10 . The isoperimetric inequality. Let V ( E) denote the volume of a set E C R", and 
A + B the (vector) sum of the sets A.B C R" . (The sum in the sense of Minkowski 
is meant. See Problem 4 in Sect. 1 1.2.) 

Let B be a ball of radius h. Then A + B =: A/, is the /; -neighborhood of the set A. 
The quantity 


lim 

/i-s-0 


V(A h )-V(A) 

h 


—■ M+O A) 


is called the Minkowski outer area of the boundary d A of A. 


a) Show that if 3 A is a smooth or sufficiently regular surface, then /r_|_(3A) 
equals the usual area of the surface 3 A. 

b) Using the Brunn-Minkowski inequality (Problem 4 of Sect. 11.2), obtain now 
the classical isoperimetric inequality in R" : 


/r+(3A) > nv” V » (A) =: /u.(Sa)', 


here V is the volume of the unit ball in R", and ii(S,\) the area of the ((n — 1)- 
dimensional) surface of the ball having the same volume as A. 

The isoperimetric inequality means that a solid A C R" has boundary area 
/x+(3A) not less than that of a ball of the same volume. 


12.5 Elementary Facts About Differential Forms 

We now give an elementary description of the convenient mathematical machinery 
known as differential forms, paying particular attention here to its algorithmic poten- 
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tial rather than the details of the theoretical constructions, which will be discussed 
in Chap. 15. 


12.5.1 Differential Forms: Definition and Examples 

Having studied algebra, the reader is well acquainted with the concept of a linear 
form, and we have already made extensive use of that concept in constructing the 
differential calculus. In that process we encountered mostly symmetric forms. In the 
present subsection we will be discussing skew-symmetric (anti-symmetric) forms. 

We recall that a form L : X k — > Y of degree or order k defined on ordered sets 
£ i , . . . , of vectors of a vector space X and assuming values in a vector space Y 
is skew-symmetric or anti-symmetric if the value of the form changes sign when any 
pair of its arguments are interchanged, that is, 

In particular, if = £ ; then the value of the form will be zero, regardless of the 
other vectors. 

Example 1 The vector (cross) product [£i,£ 2 ] °f two vectors in R 3 is a skew- 
symmetric bilinear form with values in R 3 . 

Example 2 The oriented volume V (§j , . . . , £j.) of the parallelepiped spanned by the 
vectors £j, . . . , of 1^, defined by Eq. (12.6) of Sect. 12.4, is a skew-symmetric 
real-valued k-form on R^. 

For the time being we shall be interested only in real-valued £ -forms (the case 
Y — R), even though everything that will be discussed below is applicable to the 
more general situation, for example, when Y is the field C of complex numbers. 

A linear combination of skew-symmetric forms of the same degree is in turn a 
skew-symmetric form, that is, the skew-symmetric forms of a given degree consti- 
tute a vector space. 

In addition, in algebra one introduces the exterior product A of skew-symmetric 
forms, which assigns to an ordered pair A p , B q of such forms (of degrees p and q 
respectively) a skew-symmetric form A p A B q of degree p + q. This operation is 

associative: (A p A B q ) A C r — A p A ( B q A C r ), 

distributive: (A p + B p ) a C q = A p A C q + B p A C q , 

skew-commutative: A p A B q = (— Y) pq B q A A p . 

In particular, in the case of 1 -forms A and B, we have anticommutativity A A B — 
— B A A, for the operations, like the anticommutativity of the vector product shown 
in Example 1. The exterior product of forms is in fact a generalization of the vector 
product. 
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Without going into the details of the definition of the exterior product, we take 
as known for the time being the properties of this operation just listed and observe 
that in the case of the exterior product of 1-forms L\, . . . , L k e £(R", R) the result 
Li A • • ■ A L k is a k - form that assumes the value 


L i A ■ • • A Lki^, ...,% k ) = 


L\ (§i) • ■ ■ £t(£ i) 

L \ (£*) L k ($ k ) 


d et(L; ($,■)) (12.11) 


on the set of vectors . . . , !- k . 

If relation (12. 1 1) is taken as the definition of the left-hand side, it follows from 
properties of determinants that in the case of linear forms A, B, and C, we do indeed 
have A A B — —B A A and (A + B)aC — AaC + BaC. 

Let us now consider some examples that will be useful below. 


Example 3 Let tt 1 e £(R", R), i = 1, . . . , n, be the projections. More precisely, the 
linear function n 1 : R" —>■ R is such that on each vector £ = (£*, . . . , £") e R" it 
assumes the value n l (£) = f ' of the projection of that vector on the corresponding 
coordinate axis. Then, in accordance with formula (12.1 1) we obtain 


a--- a jr'*($i, ...,£*) = 




eh 


h 7 

5l 

t>k 


( 12 . 12 ) 


Example 4 The Cartesian coordinates of the vector product [§ j , £ 2 ] of the vectors 
= (£j, fj 2 , § 3 ) and £ 2 = (£ 2 , £ 2 ’ ^ Euclidean space R 3 , as is known, are 

defined by the equality 


Bi.€ 2 ] = 


?t 2 ^ 

$2 g 


f? ft 1 

g ?2 


*>' ?, 2 

? 2 


Thus, in accordance with the result of Example 3 we can write 
^([Sl^]) =lt 2 A7Z 3 ($ 1 ,$ 2 )< 

77-2 ([§i, §2]) = 71-3 A § 2 )> 

77-3 ([ll’ §2]) = Tt 1 A 77 “'(§ 1 , § 2 ). 


Example 5 Let / : D — > R be a function that is defined in a domain D C R" and dif- 
ferentiable at xq e D. As is known, the differential d/(xo) of the function at a point 
is a linear function defined on displacement vectors £ from that point. More pre- 
cisely, on vectors of the tangent space TD Xq to D (or R") at the point under consid- 
eration. We recall that if x , . . . , x n are the coordinates in R" and § = (f 1 , . . . , §"), 
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then 

d/(*o)(«) = %(x o)? 1 + • ■ ■ + ¥r(xo)S n = D*/(x, o). 
dx L ox n 

In particular dx' (§) = | ', or, more formally, dx' (xo)(£) = f ' . If f\, fk are 
real- valued functions defined in G and differentiable at the point xq e G, then in 
accordance with (12.1 1) we obtain 


d/i($i) 


d/tA-Ad/^j y = 


d/i(€*) 


d/*(«i) 

d/t(«t) 


(12.13) 


at the point xq for the set § j , . . . , i; k of vectors in the space TG X0 ; and, in particular, 


dx' 1 A • • • A dx'^j 


.«*) = 




£ 




(12.14) 


In this way skew-symmetric forms of degree k defined on the space TD X0 ~ 
TM." q ~ R" have been obtained from the linear forms d/i, . . . , d fk defined on this 
space. 


Example 6 If / e C (1) (D,R), where Z) is a domain in R", then the differential 
d/(x) of the functions / is defined at any point x e D. and this differential, as 
has been stated, is a linear function d/(x) : TD X -> fl «j.) R on the tangent 
space TD X to D at x. In general the form d/(x) = f'(pc) varies in passage from one 
point to another in D. Thus a smooth scalar- valued function / : D — > R generates a 
linear form d/(x) at each point, or, as we say, generates afield of linear forms in D, 
defined on the corresponding tangent spaces TD X . 


Definition 1 We shall say that a real-valued differential p-form o> is defined in the 
domain D C R" if a skew-symmetric form co (x) : ( TD X ) P —*■ R is defined at each 
point x e D. 


The number p is usually called the degree or order of co. In this connection the 
p-form co is often denoted c» p . 

Thus, the field of the differential d / of a smooth function / : D — > R considered 
in Example 6 is a differential 1-form in D, and a> = dx' 1 A ■ • • A dx''' is the simplest 
example of a differential form of degree p. 

Example 7 Suppose a vector field D C R" is defined, that is, a vector F(x) is at- 
tached to each point x e D. When there is a Euclidean structure in R" this vector 
field generates the following differential 1-form in D. 
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If § is a vector attached to x e D, that is, § e TD X , we set 

<4(x)($) = (F(x),$). 

It follows from properties of the inner product that co^(x) — (F(x), •} is indeed a 
linear form at each point x e D. 

Such differential forms arise very frequently. For example, if F is a continuous 
force field in D and § an infinitesimal displacement vector from the point x e D, 
the element of work corresponding to this displacement, as is known from physics, 
is defined precisely by the quantity (F(x), §}. 

Thus a force field F in a domain D of the Euclidean space M" naturally generates 
a differential 1-form of in /). which it is natural to call the work form of the field F 
in this case. 

We remark that in Euclidean space the differential d / of a smooth function / : 
D — > R. in the domain D C R™ can also be regarded as the 1-form generated by a 
vector field, in this case the field F = grad /. In fact, by definition grad / is such 
that d/(x)(£) = (grad /(x), (•} for every vector § e TD X . 

Example 8 A vector field V defined in a domain D of the Euclidean space R" can 
also be regarded as a differential form w'f~ 1 of degree n — 1 . If at a point x e D we 
take the vector field V(x) and n — 1 additional vectors £ [ e TD X attached to 

the point x, then the oriented volume of the parallelepiped spanned by the vectors 
V(x), , . . . , £„_j, which is the determinant of the matrix whose rows are the co- 

ordinates of these vectors, will obviously be a skew-symmetric ( n — l)-form with 
respect to the variables § t 1 • 

For n = 3 the form coy is the usual scalar triple product (V(x), , § 2 ) °f vectors, 

one of which V(x) is given, resulting in a skew-symmetric 2-form &>y = (Y, •, •)• 

For example, if a steady flow of a fluid is taking place in the domain D and 
V(x) is the velocity vector at the point x e D. the quantity (V(x), £ i , £ 2 ) i s the 
element of volume of the fluid passing through the (parallelogram) area spanned 
by the small vectors e TD X and § 2 e TD X in unit time. By choosing different 
vectors £ j and we shall obtain areas (parallelograms) of different configuration, 
differently situated in space, all having one vertex at x. For each such area there 
will be, in general, a different value (V(x), § j , £ 2 ) of the form &>y(x). As has been 
stated, this value shows how much fluid has flowed through the surface in unit time, 
that is, it characterizes the flux across the chosen element of area. For that reason 
we often call the form <Wy (and indeed its multidimensional analogue a/^T 1 ) the flux 
form of the vector field V in D. 


12.5.2 Coordinate Expression of a Differential Form 

Let us now investigate the coordinate expression of skew-symmetric algebraic and 
differential forms and show, in particular, that every differential k-form is in a certain 
sense a linear combination of standard differential forms of the form (12.14). 
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To abbreviate the notation, we shall assume summation over the range of allow- 
able values for indices that occur as both superscripts and subscripts (as we did 
earlier in similar situations). 

Let L be a ^-linear form in R". If a basis ei, . . . , e„ is fixed in R", then each 
vector § e R" gets a coordinate representation § = §'e, in that basis, and the form L 
acquires the coordinate expression 

h) = MSi'V • • • - = Lffih . • • • . e h)fi‘ • • • £**■ (12.15) 

The numbers = Lte,-, , . . . , e, t ) characterize the form L completely if the 

basis in which they have been obtained is known. These numbers are obviously 
symmetric or skew-symmetric with respect to their indices if and only if the form L 
possesses the corresponding type of symmetry. 

In the case of a skew-symmetric form L the coordinate representation can be 
transformed slightly. To make the direction of that transformation clear and natural, 
let us consider the special case of (12.15) that occurs when L is a skew-symmetric 
2-form in R 3 . Then for the vectors £ j =£j 1 e ,- 1 and § 2 = % 2 e ' 2 ' w h ere ii, h = 1,2,3, 
we obtain 

i, $ 2 ) = *2 2 e.- 2 ) = L ^h ■ e; 2 )^ 2 = 

= L(ei, ei)^ 1 ^ + i( e i> e 2 )?i ! ?2 + ^( e i> e 3)§/f 2 + 

+ L(e 2 , ei)£ 2 £ 2 + ^( e 2. £2)%\%2 + ^( e 2> e 3)t 2 ?2 + 

+ i(e 3 , ei )£ 3 £ 2 + L(e 3 , e 2 )f 3 § 2 + L(e 3, e 3 )f 3 £| = 

= L(ei, e 2 )(^f - ?i 2 ^) + ^(ei, e 3 )(f 1 3 ? 2 3 - ffo 2 ) + 

+ L(e 2 ,e 3 )(^ 2 3 -f 1 3 S 2 2 ) = E Ltei’ti*) 5 ffi - 

1</i</2<3 - ^2 

where the summation extends over all combinations of indices i\ and ij that satisfy 
the inequalities written under the summation sign. 

Similarly in the general case we can also obtain the following representation for 
a skew-symmetric form L: 


^(£i> •■•>£*) — E L(e 'i ’ ■ ■ ■ ’ e hl 

l<il <~-<ik<n 




$ 


fc'l ... fc'* 
Sit ?<: 


(12.16) 


Then, in accordance with formula (12.12) this last equality can be rewritten as 


£(£1 >.■•>£*)= E L ( e h’ ■ A •• • A7 t'*(§i, . . . ,£*). 

\<il<—<ik<i n 
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Thus, any skew-symmetric form L can be represented as a linear combination 

L = X a h ... ik 7t h A--- A7t ik (12.17) 

l<j'l <•*•<!*<(„ 

of the k-forms i r n A • ■ • A jr'R which are the exterior product formed from the 
elementary 1 -forms it 1 , . . . , n n in R" . 

Now suppose that a differential k-form a> is defined in some domain D C R" 
along with a curvilinear coordinate system x 1 , . . . , x n . At each point x e D we fix 
the basis ei (x), . . . , e„ (x) of the space TD X , formed from the unit vectors along the 
coordinate axes. (For example, if x 1 , .... x” are Cartesian coordinates in R” , then 
ei(x), .. . , e„(x) is simply the frame ei, . . . , e„ in R" translated parallel to itself 
from the origin to x .) Then at each point x e D we find by formulas (12.14) and 
(12.16) that 

«(x)(S i, ...,$*) = 

= XI w ( e fi( x ) e, t (x))dx' 1 A'"Adt:' l (5i y 

1 < / 1 <—<ik<n 

or 

co(x)= Qii~i k (x) dx 11 A • ■ ■ A dx‘ k . (12.18) 

l<ii<— 

Thus, every differential A' -form is a combination of the elementary k-forms dx' 1 A 
■ • ■ A dx lk formed from the differentials of the coordinates. As a matter of fact, that 
is the reason for the term “differential form”. 

The coefficients a (x) of the linear combination (12.18) generally depend on 
the point x, that is, they are functions defined in the domain in which the form co k is 
given. 

In particular, we have long known the expansion of the differential 

d fix) = ^(i)dr 1 + • • • + j^(x)dx'\ (12.19) 

and, as can be seen from the equalities 

(F ,$)={F il e il (x),S i2 e i2 (x)) = 

= (eq(x), e i2 (x))F ik (x)^ = g hi2 (x)F h (x)^ = 

= g lll2 (x)F l '(x)dx l H^, 


the expansion 


<Wp(x) = (F(x), •) = (g hi (x)F' 1 (x)) due' — at (x) dx 


( 12 . 20 ) 
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also holds. In Cartesian coordinates this expansion looks especially simple: 


4(x) = (F(x),-) = ^]F''(x)dx i '. 


i = 1 


Next, the following equality holds in ' 

V 2 (x ) 


4w«i.y = 


V l (x) 
?! 1 
*2 


*1 


y 3 U) 

?i 3 

* 2 3 


= V ! (x) 




+ V 2 (r) 


from which it follows that 




ff 

*2 


v 3 W 


( 12 . 21 ) 


*2 £ 


ttiy(x) = y J (x) d.r 2 A d.r 3 + V 2 (r) d.r 3 A dr 1 + V 3 (r) dr 1 A dr 2 . (12.22) 


Similarly, expanding the determinant of order n for the form oJL 1 by minors 
along the first row, we obtain the expansion 

n— 1 ^ 

£Uy _1 = J^(— 1) !+1 V' (r) dr 1 A • • • A d.C A • • • A dr", (12.23) 

i= 1 


where the sign ^ stands over the differential that is to be omitted in the indicated 
term. 


12.5.3 The Exterior Differential of a Form 

All that has been said up to now about differential forms essentially involved each 
individual point x of the domain of definition of the form and had a purely algebraic 
character. The operation of (exterior) differentiation of such forms is specific to 
analysis. 

Let us agree from now on to define the 0 -forms in a domain to be functions 
/ : D — »■ R. defined in that domain. 

Definition 2 The ( exterior ) differential of a 0-form /, when / is a differentiable 
function, is the usual differential df of that function. 

If a differential p-form (p > 1) defined in a domain D C R" 

a >(r) = cti v ..i (x) dr ' 1 A • • • A d.r' p 
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has differentiable coefficients a (| ... ip (x), then its ( exterior ) differential is the form 

dto(x) = d aj l ...j p (x) A dx' 1 A • • ■ A d.r' p . 

Using the expansion (12.19) for the differential of a function, and relying on the 
distributivity of the exterior product of 1 -forms, which follows from relation (12.11), 
we conclude that 


d a j i . . . j 

dtu(x) = — . p (x) dx' A dx' 1 A • • • A dx' p = 

= a iii -i p ( x ) dx' A dx' 1 A • • • A dx' p , 

that is, the (exterior) differential of a /t-forin (p > 0) is always a form of degree 
P + 1- 

We note that Definition 1 given above for a differential p- form in a domain 
DcR", as one can now understand, is too general, since it does not in any way 
connect the forms to(x ) corresponding to different points of the domain D. In ac- 
tuality, the only forms used in analysis are those whose coordinates fl il ..., p (x) in 
a coordinate representation are sufficiently regular (most often infinitely differen- 
tiable) functions in the domain D. The order of smoothness of the form to in the 
domain D C IK." is customarily characterized by the smallest order of smoothness 
of its coefficients. The totality of all forms of degree p > 0 with coefficients of class 
C (oo) {D , R) is most often denoted P2 p (D , R) or P2 p . 

Thus the operation of differentiation of forms that we have defined effects a map- 
ping d : P2 P —> f2 p+1 . 

Let us consider several useful specific examples. 


Example 9 For a 0-form to = /(x, y, z) - a differentiable function - defined in a 
domain D C R 3 , we obtain 


3/ 3/ 3 f 

do; = — dx + — dy + — dz. 
dx dy dz 


Example 10 Let 

to (x , y) = P(x, y) dx + Q(x, y)dy 

be a differential 1-form in a domain D of R 2 endowed with coordinates (x, y). 
Assuming that P and Q are differentiable in D, by Definition 2 we obtain 


d to(x, v) = d.P A dx + d Q A dy = 

(d P dP 

= — dx + — dy | A dx ■ 

\ ox dy 


3 Q 

dx 


dx ■ 


^d»|Ady = 


dP 

dy 


dy A dr + 


a Q 

dx 


dx a dy = 




(x, y) dr A dy. 
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Example 11 For a 1-form 


co — P dr + Q d y + R dz 
defined in a domain D in R 3 we obtain 


dR dQ 


(3P dR 


3Q dP\ 


dw= {^-^) dyAdz+ {^-^) dzAdx + {-^-^) dxAdy - 

Example 12 Computing the differential of the 2-form 

co = P dy A dz + Q dz A d.r + R dx A dy, 
where P, Q, and R are differentiable in the domain D C M 3 , leads to the relation 


(dP dQ dR\ 

da) = + 97 + a- ) dx A dy A d "' 

If (a 1 , a 2 , a 3 ) are Cartesian coordinates in the Euclidean space R 3 and a i-a 
/(a), a i— > F(a) = (F 1 , F 2 , F 3 )(a), and = (V 1 , V 2 , V 3 )(x) are smooth 

scalar and vector fields in the domain D C R 3 , then along with these fields, we 
often consider the respective vector fields 


, , , 9/ 3/ 3/ , , , f , 

grad f = \ 77’ at 3 ’ dx 2 * ” the gradient of f' 


/ 9F 3 

dF 2 

dF 1 

dF 3 

dF 2 

dF l \ 

\ dx 2 

dx 3 ’ 

9a 3 

9a 1 ’ 

dx 1 

dx 2 ) 


curlF = I — r 7T, — ^ ■ — r. — ; =r I - the curl of F, 


and the scalar field 


9V 1 dV 2 9 V 3 

divY = — j- -| ^ = — 5 - - the divergence of Y. 

dx 1 dx - dx 3 


(12.24) 

(12.25) 


(12.26) 


We have already mentioned the gradient of a scalar field earlier. Without dwelling 
on the physical content of the curl and divergence of a vector field at the moment, 
we note only the connections that these classical operators have with the operation 
of differentiating forms. 

In the oriented Euclidean space R 3 there is a one-to-one correspondence between 
vector fields and 1 - and 2 -forms: 

F+*c4=<F, •>, Va 4(V, v ). 

We remark also that every 3-form in the domain D C R 3 has the form p(x l ,x 2 , 
a 3 ) dr 1 A d.r 2 A dv 3 . Taking this circumstance into account, one can introduce the 
following definitions for grad /, curlF, and div V: 

/ h* ojP /) HA d<w°(= d /) = Wg HA g := grad /, 


(12.24') 
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F i->- a>p i-»- dttip = a>r i->- r := curlF, (12.25') 

Yn-aiyH- d&jy = o? p \-+ p := div Y. (12.26') 

Examples 9, 1 1, and 12 show that when we do this in Cartesian coordinates, we 
arrive at the expressions (12.24), (12.25), and (12.26) above for grad/, curlF, and 
divY. Thus these operators in field theory can be regarded as concrete manifesta- 
tions of the operation of differentiation of exterior forms, which is carried out in 
a single manner on forms of any degree. More details on the gradient, curl, and 
divergence will be given in Chap. 14. 


12.5.4 Transformation of Vectors and Forms Under Mappings 

Let us consider in more detail what happens with functions (0-forms) under a map- 
ping of their domains. 

Let <p : U — ► V be a mapping of the domain U C K m into the domain V cl". 
Under the mapping (p each point t e U maps to a definite point x = (pit) of the 
domain V . 

If a function f is defined on V , then, because of the mapping <p : U V a 
function <p* f naturally arises on the domain U, defined by the relation 

(■ <P*f)(t ) := f(<p(t)), 

that is, to find the value of ip* f at a point t e U one must send t to the point x = 
(pit) e V and compute the value of / at that point. 

Thus, if the domain U maps to the domain V under the mapping (p : U V, 
then the set of functions defined on V maps (in the opposite direction) to the set of 
functions defined on U under the correspondence / (p* f just defined. 

In other words, we have shown that a mapping <p* : £2°(V) —> Q {] iU) transform- 
ing 0-forms defined on V into 0-forms defined on U naturally arises from a mapping 
tp: £/-» V. 

Now let us consider the general case of transformation of forms of any degree. 
Let (p : U —*■ V be a smooth mapping of a domain U C into a domain 
VclJ, and <p'(t) : TU t —> TV x =<p(t ) the mapping of tangent spaces corresponding 
to cp, and let o> be a /;-form in the domain V. Then one can assign to o> the /;-form 
(p*u> in the domain U defined at t e U on the set of vectors Ti, . . . , r p e TU, by the 
equality 

<P*G>(t)( Ti, . . . , Tp) := co(<p(t))(<p[Ti,...,<p' p T p ). (12.27) 

Thus to each smooth mapping <p : U —*■ V there corresponds a mapping o>* : 
Q p iV) — >• Q p HJ) that transforms forms defined on V into forms defined on U. It 
obviously follows from (12.27) that 

(p* ((o' + ft/') = tp* (ft/) + (p* (ft/') , 


(12.28) 
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<p*(Xco) = k<p*(o, if A e R. (12.29) 

Recalling the rule (t/r o cp)' = xjr' o <p' for differentiating the composition of the 
mappings (p : U —*■ V , t/r : V —> W, we conclude in addition from (12.27) that 

(pjr o <p)* = tp* o f* (12.30) 

(the natural reverse path: the composition of the mappings) 

f* : S2 P {W) -* i2 p (V), <p* :£2 p (V)-> £2 P {U). 

Now let us consider how to carry out the transformation of forms in practice. 


Example 13 In the domain V C R" let us take the 2-form a> = dx' 1 A dx' 2 . Let 
x 1 = x' (t l , . . . , t m ), i — n, be the coordinate expression for the mapping (p : 

U -» V of a domain U C K™ into V. 

We wish to find the coordinate representation of the form (p*a> in U . We take 
a point t e U and vectors ri,T 2 e TU t . The vectors = (p'(t)x \ and § 2 — 
ip f {t)x 2 correspond to them in the space TV x =(p(t)- The coordinates (£j,...,£") 
and (f 2 ,...,£ 2 ) °f these vectors can be expressed in terms of the coordinates 
(tj , . . . , rj") and (t 2 , . r™) of ti and T 2 using the Jacobian matrix via the for- 
mulas 




52 dtJ 2 


(The summation on j runs from 1 to m.) 
Thus, 


i = 1 , . . . , n . 


<P*a)(t)( n, r 2 ) := «(<?( 0 )($i. £2) = d * !1 A d*‘ 2 ($i> I2) = 





6a 1 




3x‘l j 1 

drJ 1 1 

3x‘l h 

dth 2 


3x'’2 J2 

PI 1 

9^2 72 

- 


= E 


21.72=1 


3.r u 3 jc' 2 

3fh 3ft2 


1. 1 1 1 


T 

z h 

Z 2 


‘1 

T h 

T 2 


m 

= E 

71 . 72=1 


dx' 1 dx' 2 
dt h dth 


dfh A dt ,2 (r 1 , x 2 ) = 


= E 

l<71<72<m 

= E 

1 <71 <hSm 


3x !| 

dx' 2 

dth 

dth 

dxh 

dxh 

dth 

dth 

dx‘i 

dxh 

d/J2 



dx 1 ' dx' 2 \ 

dth dth J 


dth A dt ' 2 (r u x 2 ) = 


(r)dr- 71 Ad7 /2 (n, x 2 )■ 
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Consequently, we have shown that 


<p*( dx* 1 A dx‘ 2 ) 


E 

l<z’l <i2<tn 


d(x il ,x‘ 2 ) 

d(thjh) 


( t ) dC 1 A d C 2 . 


If we use properties (12.28) and (12.29) for the operation of transformation of 
forms 9 and repeat the reasoning of the last example, we obtain the following equal- 
ity: 


( E aii,...,i p (x)dx n A--- Adx ,p j = 

'1 <rii Cn ' 


fi/V’l rb} 

Y Ui u ...j p (x(f)) , ’ — — dC 1 A • • • A dt ]p . (12.31) 


1<«1 <—<i p <n 


9 (Ci, tJp) 


We remark that if we make the formal change of variable x = x(t) in the form 
that is the argument of cp* on the left, express the differentials dx dx" in terms 
of the differentials df 1 , . . . , dr'" , and gather like terms in the resulting expression, 
using the properties of the exterior product, we obtain precisely the right-hand side 
of Eq. (12.31). 

Indeed, for each fixed choice of indices i \ ..... i p we have 
a/j,...,/ (jc)djc' 1 A • • • A d x' p = 

fl r 'i 9r'p 

= at, , (x(i )) ■ . . . ■ — dt n A • • • A dt^ = 

’ dtn d t Jp 


= E 

1 <h<-j P <m 


xb~) 

a i\ in( x (0) — ’ — t — dt Jl A • • • A dt Jp . 


Summing such equalities over all ordered sets 1 < i\ < ■ ■ ■ < i p < n, we obtain the 
right-hand side of (12.31). 

Thus we have proved the following proposition, of great technical importance. 


Proposition If a differential form to is defined in a domain V C R" and <p :U —*■ V 
is a smooth mapping of a domain U C R m into V, then the coordinate expression 


9 If (12.29) is used pointwise, one can see that 


<p*(a{x)(o) = a(ip(tf)ip* a>. 
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of the form cp*u> can be obtained from the coordinate expression 


E 


<3fi i (jc) dx' 1 A • • • A d x' p 


l<il<— <i p <n 


of the form co by the direct change of variable x = <p(t) (with subsequent transfor- 
mations in accordance with the properties of the exterior product). 


Example 14 In particular, if m — n = p, relation (12.31) reduces to the equality 

^(dx 1 A • ■ ■ A dx") = Astcp f (t)At l A ■ • ■ A dr". (12.32) 

Hence, if we write f(x) dx 1 A A dx" in a multiple integral instead of 
fix) dx 1 • • • dx" , the formula 


l 


V=cp(U) 


f(x) dx = 


Ju 


AeXcp'(t) dr 


for change of variable in a multiple integral via an orientation-preserving diffeomor- 
phism (that is, when det cp' (t) > 0) could be obtained automatically by the formal 
substitution x = ^?(r), just as happened in the one-dimensional case, and it could be 
given the following form: 


I co— I cp"co. 

J<p(U) Ju 


(12.33) 


We remark in conclusion that if the degree p of the form co in the domain V C M" 
is larger than the dimension m of the domain U C R'" that is mapped into V via 
cp\U —*■ V, then the form cp*a> on U corresponding to co is obviously zero. Thus the 
mapping cp* : £2 P (V) —*■ £2 P (U) is not necessarily injective in general. 


V has a smooth inverse cp 1 : V — »■ U , then by 
-1 =ey,we find that (cp)* o ( cp ~ ! )* = 


On the other hand, if cp : U 
(12.30) and the equalities cp~ x o cp — eu , cp o cp 
e^j and (cp~ 1 )* o cp* — ey. And, since e£, and ey are the identity mappings on 
K2 P (U) and £2 P (V) respectively, the mappings cp* : £2 P (V) — »■ H2 P (U) and (cp~ 1 )* : 
£2 P (U) — > Q P (V), as one would expect, turn out to be inverses of each other. That 
is, in this case, the mapping cp* : £2 P (V) -> C2 P (U) is bijective. 

We note finally that along with the properties (12.28)— (12.30) the mapping cp* 
that transfers forms, as one can verify, also satisfies the relation 


cp*( Aw) — A[cp*u>). 


(12.34) 


This theoretically important equality shows in particular that the operation of 
differentiation of forms, which we defined in coordinate notation, is actually inde- 
pendent of the coordinate system in which the differentiable form co is written. This 
will be discussed in more detail in Chap. 15. 
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12.5.5 Forms on Surfaces 


Definition 3 We say that a differential p-form co is defined on a smooth surface 
5CM” if a p-form co(x) is defined on the vectors of the tangent plane TS X to S at 
each point x e S. 

Example 15 If the smooth surface S is contained in the domain D C M" in which 
a form co is defined, then, since the inclusion TS X C TD X holds at each point x e S, 
one can consider the restriction of w{x) to TS X . In this way a form a> | y arises, which 
it is natural to call the restriction of co to S. 

As we know, a surface can be defined parametrically, either locally or globally. 
Let cp :U — >■ S = cp(U) C D be a parametrized smooth surface in the domain D and 
co a form on D. Then we can transfer the form co to the domain U of parameters and 
write cp*co in coordinate form in accordance with the algorithm given above. It is 
clear that the form (p*co in U obtained in this way coincides with the form cp*{co\s). 

We remark that, since cp'(t) : TU t — » TS X is an isomorphism between Til , and 
TS X at every point t e if we can transfer forms both from S to U and from U to S, 
and so just as the smooth surfaces themselves are usually defined locally or globally 
by parameters, the forms on them, in the final analysis, are usually defined in the 
parameter domains of local charts. 

Example 16 Let of be the flux form considered in Example 8, generated by the 
velocity field V of a flow in the domain D of the oriented Euclidean space R 3 . If S 
is a smooth oriented surface in D, one may consider the restriction of\s of the form 
of to S. The form of \ y so obtained characterizes the flux across each element of 
the surface S. 

If <p : I — » S is a local chart of the surface S, then, making the change of variable 
x = (pit) in the coordinate expression (12.22) for the form of, we obtain the coor- 
dinate expression for the form cp*of — (p*(of\s), which is defined on the square I, 
in these local coordinates of the surface. 

Example 17 Let co^ be the work form considered in Example 7, generated by the 
force field F acting in a domain D of Euclidean space. Let cp : I -* cp(I) C D be 
a smooth path ( cp is not necessarily a homeomorphism). Then, in accordance with 
the general principle of restriction and transfer of forms, a form (p*of arises on the 
closed interval /, whose coordinate representation a(t)dt can be obtained by the 
change of variable x = cp(t) in the coordinate expression (12.21) for the form of. 


12.5.6 Problems and Exercises 

1. Compute the values of the differential forms co in M” given below on the indi- 
cated sets of vectors: 
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a) &> = x 2 dx 1 on the vector £ = (1, 2, 3) e TM.^ 2 , 1 ). 

b) co — dx 1 A dx 3 + x'dx 2 A dx 4 5 on the ordered pair of vectors §[,£2 e 
^(l.O.O.O)- 

c) co — df, where / = x 1 + 2x 2 H h nx n , and £ = (1, — , 1, . . . , (— 1)" _1 ) e 

™(t, 1, 

2. a) Verify that the form dx' 1 A • • • A dx'* is identically zero if the indices i\, ... ,ik 
are not all distinct. 

b) Explain why there are no nonzero skew-symmetric forms of degree p > n on 
an n -dimensional vector space. 

c) Simplify the expression for the form 

2 dx 1 A dx 3 A dx 2 + 3 dx 3 A dx 1 A dx 2 — dx 2 A dx 3 A dx 1 . 

d) Remove the parentheses and gather like terms: 

(x 1 dx 2 + x 2 dx 1 ) A (x 3 dx 1 A dx 2 + x 2 dx 1 A dx 3 + x 1 dx 2 A dx 3 ). 

e) Write the form df A d g, where / = ln(l + |x| 2 ), g — sin|x|, and x = 
(x 1 , x 2 , x 3 ) as a linear combination of the forms dx' 1 A dx' 2 , 1 < i \ < 12 < 3. 

f) Verify that in M" 


d/ 1 A • ■ • A d f n (x) = det 



(x) dx 1 A ■ • ■ A dx". 


g) Carry out all the computations and show that for 1 < k < n 


d/ 1 A • • • A df k = det 


3 /‘ 
3x'l 
3 /* 
3x'l 


3/ 1 

3 x‘k 

3 f k 


dx' 1 A • • ■ A dx li . 


3. a) Show that a form a of even degree commutes with any form ft , that is, a A f = 
ft A a. 

b) Let co = Yl'!= 1 d /?/ A and co n — co A ■ • • A co (n factors). Verify that oo" = 

n\dp\ A d<7 1 A • • • A dp n A d q" = (— 1 ) dpi A • • ■ A dp„ A d<7 1 A • ■ • A d q n . 

4. a) Write the form co = df, where f(x) — (x 1 ) 2 + (x 2 ) 2 + • • • + (x") 2 , as a 
combination of the forms dx 1 , . . . , dx" and find the differential d&> of co. 

b) Verify that d 2 f = 0 for any function / e C (2) (Z), R), where d 2 = d o d, and 
d is exterior differentiation. 

c) Show that if the coefficients of the form co = aj u j k (x) dx' 1 A • • • A 

dx lk belongs to the class C (2) (Z), R), then d 2 co = 0 in the domain D. 

d) Find the exterior differential of the form 9 * 4 -' i n its domain of definition. 

x A +y A 

5. If the product dx 1 • ■ ■ dx" in the multiple integral f D /(x) dx 1 • • ■ dx" is inter- 
preted as the form dx 1 A ■ ■ • A dx", then, by the result of Example 14, we have the 
possibility of formally obtaining the integrand in the formula for change of variable 
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in a multiple integral. Using this recommendation, carry out the following changes 
of variable from Cartesian coordinates: 

a) to polar coordinates in R 2 , 

b) to cylindrical coordinates in R 3 , 

c) to spherical coordinates in R 3 . 

6. Find the restriction of the following forms: 

a) d.v' to the hyperplane x‘ — 1. 

b) dx A dy to the curve x — x(t), y = y{t), a < t < b. 

c) d.v A dy to the plane in R 3 defined by the equation x = c. 

d) dy A dz + dz A d.x + d.v A dy to the faces of the standard unit cube in R 3 . 

e) a>j — dx 1 A • ■ • A d.v ,_1 A dx' A d.r' +1 A • • • A d.v" to the faces of the standard 
unit cube in R" . The symbol stands over the differential dr' that is to be omitted 
in the product. 

7. Express the restriction of the following forms to the sphere of radius R with 
center at the origin in spherical coordinates on R 3 : 

a) d.v, 

b) dy, 

c) dy A dz. 

8. The mapping tp : R 2 -> R 2 is given in the form ( u , v) ho- (u ■ v, 1) = ( x , y). Find: 

a) <p*( dx), 

b) <p*{dy), 

c) <p*(ydx). 

9. Verify that the exterior differential d : £2 P (D) -> T? ,,+ l (D) has the following 
properties: 

a) d(a)i + u> 2 ) = deni + dwi, 

b) d(&>i A ( 02 ) — dwi A a >2 + (— 1 ) deg “i w\ A deni, where deg w\ is the degree of 
the form u> \ . 

c) Yen e i2 p d(d&>) = 0. 

d) VfeV°df = j:i = i-B dxi - 

Show that there is only one mapping d : £2 P {D) — > Q p+l (D) having proper- 
ties a), b), c), and d). 

10. Verify that the mapping cp* : fi p (V) —*■ C2 P (U) corresponding to a mapping 
<p : U -> V has the following properties: 

a) (p*(co\ + 0 ) 2 ) — ^p* co 1 + cp*u> 2 - 

b) (p* (o)\ A C 02 ) — cp*u>\ A (p*a>2- 

c) d (p*to — cp* dco. 

d) If there is a mapping i/r : V — > W, then ( 1 fr o cp)* = cp* oi/r*. 

11. Show that a smooth ^-dimensional surface is orientable if and only if there 
exists a /.'-form on it that is not degenerate at any point. 



Chapter 13 

Line and Surface Integrals 


13.1 The Integral of a Differential Form 

13.1.1 The Original Problems, Suggestive Considerations, 
Examples 

a. The Work of a Field 

Let F (x ) be a continuous force field acting in a domain G of the Euclidean space R" . 
The displacement of a test particle in the field is accompanied by work. We ask how 
we can compute the work done by the field in moving a unit test particle along a 
given trajectory, more precisely, a smooth path y : I — > y{I) C G. 

We have already touched on this problem when we studied the applications of the 
definite integral. For that reason we can merely recall the solution of the problem 
at this point, noting certain elements of the construction that will be useful in what 
follows. 

It is known that in a constant field F the displacement by a vector £ is associated 
with an amount of work (F, £}. 

Let t \ — y x(t) be a smooth mapping y : I -* G defined on the closed interval 
I = {t e M | a < t < b). 

We take a sufficiently fine partition of the closed interval [«,£>]. Then on each 
interval /,- = {t e l I ti- 1 < t < ti) of the partition we have the equality x(f) — 
x(f,) » x'(t)(ti — ti- 1 ) up to infinitesimals of higher order. To the displacement 
vector r, = tj + \ — t, from the point ti (Fig. 13.1) there corresponds a displacement 
of x(tj) in R" by the vector Ax, — x,_|_i — x;, which can be regarded as equal to 
the tangent vector £,■ = x(f, )r, to the trajectory at x(t,) with the same precision. 
Since the field F(x) is continuous, it can be regarded a locally constant, and for that 
reason we can compute the work AAj corresponding to the (time) interval /, with 
small relative error as 


AAi*>{F(xi), l-i) 
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or 


Hence, 


AAi Ri (F(x(f,-)), x(f,)r,). 

A = J2 AAi * (x(ti)),k(ti))Ati 


and so, passing to the limit as the partition of the closed interval I is refined, we find 
that 

rb 

)d f. (13.1) 


A = f (F(x(0),x(1))c 

J a 


If the expression (F(x(f)), x(f)> dr is rewritten as (F(x), dx), then, assuming the 
coordinates in R" are Cartesian coordinates, we can give this expression the form 
F l dx 1 + ■■■ + F n dx" , after which we can write (13.1) as 


A — I F l dx l 
J y 


+ • • • + F n dx r ‘ 


or as 


L — / (Up. 

J V 


(13.2) 


(13.2') 


Formula (13.1) provides the precise meaning of the integrals of the work 1-form 
along the path y written in formulas (13.2) and (13.20- 

Example 1 Consider the force field F = (— ^r ~2 > x 2 + 2 ) defined at all points of 

the plane R 2 except the origin. Let us compute the work of this field along the 
curve y\ defined as x = cos t, y = sinr, 0 < t < 2n , and along the curve defined 
by x = 2 + cost, y — sinr, 0 < t < 2jt. According to formulas (13.1), (13.2), and 
(13.20, we find 


/ W * = f ~ 

J n J n 


x 2 + y 2 


r dx 


x- + y 


dy = 
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p2tt 

JO 


sinf-(— sinf) cos f- cos f 


cos 2 f + sin 2 1 


cos 2 f + sin 2 f 


dr = 2n 


and 


/ W F = / 

J Y2 J Y. 


Y2 
2 71 


-y dx + x dy 
x 2 + y 2 


n2n 

Jo 


■ sin f (— sin t ) + (2 + cos f ) (cos f ) 
(2 + cosf) 2 + sin 2 f 

+ 2 cos f 


df = 


/'“ 7I 'l + 2cos t /‘ jr l+2cosr r° 1 

/ df = / df + / - 

J o 5 + 4 cosf Jo 5 + 4 cosf Jjr 5 


+ 2cos(27r — u) 


-r 

Jo 


5 + 4 cosf 
1 + 2 cos f 
5 + 4 cosf 


df 


f 

Jo 


5 + 4 cosf 
1 + 2 cos « 
5 + 4cos« 


5 + 4cos(27T — u) 


d u = 


du — 0 . 


Example 2 Let r be the radius vector of a point (x, y, z) e K 3 and r = |r|. Suppose 
a force field F = / (r)r is defined everywhere in R 3 except at the origin. This is a 
so-called central force field. Let us find the work of F on a path y : [0, 1] — >■ R 3 \0. 
Using (13.2), we find 

j f(r)(x dx + y dy + z dz) = ^ J f(r) d(x 2 + y 2 + z 2 ) = 

= 1 f /(f(0)dr 2 (f)= ^ f f(yW))du(t) = 

£ Jo £ Jo 

1 / — 

= 2 / , f(Vu)du = <P(r 0 ,n). 


Here, as one can see, we have set x 2 (t) + y 2 (f) + z 2 (t) — r 2 (f), r 2 (t) — u(t), 
ro — r{ 0), and r\ — r(l). 

Thus in any central field the work on a path y has turned out to depend only on 
the distances ro and n of the beginning and end of the path from the center 0 of the 
field. 

In particular, for the gravitational field p-r of a unit point mass located at the 
origin, we obtain 


<P(r 0 ,ri) = 


1 rr j_ 

2 L u- V2 


d« = — 
ro 


1 

n 


b. The Flux Across a Surface 

Suppose there is a steady flow of liquid (or gas) in a domain G of the oriented 
Euclidean space R 3 and that x i->- V(x) is the velocity field of this flow. In addition, 
suppose that a smooth oriented surface S has been chosen in G. For definiteness we 
shall suppose that the orientation of S is given by a field of normal vectors. We ask 
how to determine the (volumetric) outflow or flux of fluid across the surface S. More 
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Fig. 13.2 



precisely, we ask how to find the volume of fluid that flows across the surface S per 
unit time in the direction indicated by the orienting field of normals to the surface. 

To solve the problem, we remark that if the velocity field of the flow is constant 
and equal to V, then the flow per unit time across a parallelogram /7 spanned by 
vectors £ | and § 2 equals the volume of the parallelepiped constructed on the vectors 
V, £j, £ 2 . If V is normal to 77 and we seek the flux across 77 in the direction of i), 
it equals the scalar triple product (V, £ j , £ 2 ), provided r) and the frame § [ , § 2 give 
77 the same orientation (that is, if r), £ 1; £ 2 is a frame having the given orientation 
in R 3 ). If the frame § i , § 2 gives the orientation opposite to the one given by »/ in 77, 
then the flow in the direction of »/ is —(V, § t , § 2 ). 

We now return to the original statement of the problem. For simplicity let us 
assume that the entire surface S admits a smooth parametrization cp : I — »■ S C G, 
where 7 is a two-dimensional interval in the plane R 2 . We partition 7 into small 
intervals 7,- (Fig. 13.2). We approximate the image <^(7,) of each such interval by 
the parallelogram spanned by the images = <p'(tj ) ti and § 2 = <p'(tj) t 2 of the 
displacement vectors Ti, t 2 along the coordinate directions. Assuming that V(x) 
varies by only a small amount inside the piece of surface ^(7, ) and replacing <p( I i) 
by this parallelogram, we may assume that the flux ATi across the piece ip(Ij) of 
the surface is equal, with small relative error, to the flux of a constant velocity field 
V(xj) = \(sp(tj )) across the parallelogram spanned by the vectors § 2 . 

Assuming that the frame £ j , § 2 gives the same orientation on .S' as //, we find 

Summing the elementary fluxes, we obtain 

i i 


where a>y(x) = (V(.r), •, •) is the flux 2-form (studied in Example 8 of Sect. 12.5). 
If we pass to the limit, taking ever finer partitions P of the interval 7 , it is natural to 
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assume that 

J:= J” 0 E4feKli42)=: f<4- (13.3) 

This last symbol is the integral of the 2-form a>y over the oriented surface S. 

Recalling (formula (12.22) of Sect. 12.5) the coordinate expression for the flux 
form <Wy in Cartesian coordinates, we may now also write 

F= J V 1 dx 2 A dx 3 + V 2 dx 3 A dx 1 + V 3 dx 1 A dx 2 . (13.4) 

We have discussed here only the general principle for solving this problem. In 
essence all we have done is to give the precise definition (13.3) of the flux T and 
introduced certain notation (13.3) and (13.4); we have still not obtained any effective 
computational formula similar to formula (13.1) for the work. 

We remark that formula (13.1) can be obtained from (13.2) by replacing 
x 1 , . . . , x" with the functions (x , . . . , x")(f) = x(t) that define the path y . We re- 
call (Sect. 12.5) that such a substitution can be interpreted as the transfer of the form 
cu defined in G to the closed interval I = [a, b]. 

In a completely analogous way, a computational formula for the flux can be ob- 
tained by direct substitution of the parametric equations of the surface into (13.4). 

In fact, 

«v (*i ) ($ t . £ 2 ) = W (<P (h )) W (h ) t 1 , <p' (tt ) r 2 ) = (<p* coy) {tt ) (t 1 , t 2 ) 
and 

^tt>y(x/)(§l,§ 2 ) = ^((p* My)(tj)(T u T 2 ). 

i i 

The form (p*o:>y is defined on a two-dimensional interval / cl 2 . Any 2-form 
in I has the form fit ) dr 1 A df , where / is a function on I depending on the form. 
Therefore 

<p*a>\{ti){ n, T 2 ) = /(h) dr 1 Adr 2 (n, r 2 ). 

But dr 1 A d; 2 (r 1 , r 2 ) = tj 1 ■ ry is the area of the rectangle /, spanned by the 
orthogonal vectors it, t 2 . 

Thus, 

X! /( f <) d? 1 A dr 2 (n, t 2 ) = Y] /( f <) I 7 ' I- 
i i 

As the partition is refined we obtain in the limit 

Ji f(t) dr 1 Adt 2 = f(t) dr 1 df 2 , (13.5) 

where, according to (13.3), the left-hand side contains the integral of the 2-form 
or = fit) dt 1 A dr 2 over the elementary oriented surface /, and the right-hand side 
the integral of the function / over the rectangle I . 
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It remains only to recall that the coordinate representation fit ) dr 1 A dr 2 of the 
form (p*cciy is obtained from the coordinate expression for the form cor. by the direct 
substitution x = cp(t), where (p : I ^ G is a chart of the surface S. 

Carrying out this change of variable, we obtain from (13.4) 


-L 

-I. 


2 

[ * 




tt> v — 

1 <p 

COy = 



=<P(I) 

I 




( 

dx 2 

ax 3 


dx 3 

3a 1 

V\<p(t )) 

dt 1 

dx 2 

3d 

dx 3 

+ v 2 {(pio) 

3r 1 
3 a 3 

at 1 

3a 1 

V 

at 2 

at 2 


at 2 

at 2 


+ V\<p{t)) 


3jt 

3d 


3 a- 1 


3a z 

at 1 


3a z 


dr 2 dr 2 


dr 1 a dr 2 . 


+ 


This last integral, as Eq. (13.5) shows, is the ordinary Riemann integral over the 
rectangle I. 

Thus we have found that 


*-L 


V\<p{t)) V 2 i<p(t)) vHcpit)) 

3 d 


Vca 
at 2 (t) 


3 A (t) 
at 2 (t) 


921 ( t ) 
at 1 

3 r- 


dr 1 dr, 


(13.6) 


where x = (pit) = icp 1 , (p 2 , (p 2 )it l , t 2 ) is a chart of the surface S defining the same 
orientation as the field of normals we have given. If the chart <p : I -> S gives S the 
opposite orientation, Eq. (13.6) does not generally hold. But, as follows from the 
considerations at the beginning of this subsection, the left- and right-hand sides will 
differ only in sign in that case. 

The final formula (13.6) is obviously merely the limit of the sums of the elemen- 
tary fluxes AFj ~ (Vixi),i; j,$ 2 ) familiar to us, written accurately in the coordi- 
nates t 1 and t 2 . 

We have considered the case of a surface defined by a single chart. In general 
a smooth surface can be decomposed into smooth pieces S, having essentially no 
intersections with one another, and then we can find the flux through S as the sum 
of the fluxes though the pieces .S’, . 


Example 3 Suppose a medium is advancing with constant velocity V = (1, 0, 0). 
If we take any closed surface in the domain of the flow, then, since the density of 
the medium does not change, the amount of matter in the volume bounded by this 
surface must remain constant. Hence the total flux of the medium through such a 
surface must be zero. 

In this case, let us check formula (13.6) by taking S to be the sphere x 2 + y 2 + 
z 2 = R 2 - 
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Up to a set of area zero, which is therefore negligible, this sphere can be defined 
parametrically 


x = R cos i jr cos <p, 
y — R cos i jr simp, 
z — R sin i/r, 


where 0 < <p < 2 tt and — 7r/2 < t/r < tt/2. 

After these relations and the relation Y = (1, 0, 0) are substituted in (13.6), we 
obtain 


r S, 


dx 1 

dx 2 

dt 1 

dt 1 

dx 1 

dx 2 

dr- 

dt 2 


d<p di jr 


/ 7t /2 nln 

cos 2 V f di/r 1 cos<p dip = 0. 
-tt/2 Jo 


Since the integral equals zero, we have not even bothered to consider whether it 
was the inward or outward flow we were computing. 


Example 4 Suppose the velocity field of a medium moving in M 3 is defined in 
Cartesian coordinates x, y, z by the equality V(.r, y, z ) = (V 1 , V 2 , V 3 )(x, y, z) = 
(x, y, z). Let us find the flux through the sphere x 2 + y 2 + z 2 — R 2 into the ball that 
it bounds (that is, in the direction of the inward normal) in this case. 

Taking the parametrization of the sphere given in the last example, and carrying 
out the substitution in the right-hand side of (13.6), we find that 


7T 

r n/2 

R cos xfr cos < p 

R cos i/r sin (p 

R sin i jr 

dtp 

— R cos i jr sin</> 

R cos x/f cos q> 

0 

Jo 

J-tt/I 

R sin i jr cos (p 

— R&irwjf sin<p 

R cos xfr 


d <p 



R cos \[r di jr = 4n R 3 . 


We now check to see whether the orientation of the sphere given by the curvi- 
linear coordinates (<p, i//) agrees with that given by the inward normal. It is easy to 
verify that they do not agree. Hence the required flux is given by T — — 4tt R 3 . 

In this case the result is easy to verify: the velocity vector V of the flow has 
magnitude equal to R at each point of the sphere, is orthogonal to the sphere, and 
points outward. Therefore the outward flux from the inside equals the area of the 
sphere 4tcR 2 multiplied by R . The flux in the opposite direction is then — 4 tt I/ 3 . 


13.1.2 Definition of the Integral of a Form over an Oriented 
Surface 

The solution of the problems considered in Sect. 13.1.1 leads to the definition of the 
integral of a f -form over a /.'-dimensional surface. 
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First let S be a smooth £ -dimensional surface in E" , defined by one standard 
chart cp : I — > S. Suppose a A: -form co is defined on S. The integral of the form co 
over the parametrized surface cp : 7 — »• S is then constructed as follows. 

Take a partition P of the ^-dimensional standard interval 7 C E" induced by 
partitions of its projections on the coordinate axes (closed intervals). In each in- 
terval 7, of the partition P take the vertex f; having minimal coordinate val- 
ues and attach to it the k vectors r i , . . . , t p that go along the direction of 
the coordinate axes to the k vertices of 7, adjacent to f; (Fig. 13.2). Find the 
vectors = cp'iti)x\, . . . , § k = (p'iti)xp of the tangent space TS Xi=l p( ti ), then 
compute co(Xi)(^i, . . . , £%) =: (ip*co)(tj)(x i,...,x k ), and form the Riemann sum 
JE o>(xi)(i- ] , ... ,^ k ). Then pass to the limit as the mesh /. ( P ) of the partition tends 
to zero. 

Thus we adopt the following definition: 


Definition 1 (Integral of a £-form o> over a given chart <y& : / — > S of a smooth k- 
dimensional surface.) 


L 


co : 


lim 

X(P)^Q 


YMxiHh 


, $ k ) = lim 

l(P)^0 


X! x k ). (13.7) 


If we apply this definition to the A: -form A • ■ • A dr^ on 7 (when q> is the 

identity mapping), we obviously find that 


j' f(t) dr 1 a • • • a dt k = Ji fit ) dr 1 • • ■ dt k . 


It thus follows from (13.7) that 


/ -/ 

JS=w(I) Jl 


cp CO, 


(13.8) 


(13.9) 


and the last integral, as Eq. (13.8) shows, reduces to the ordinary multiple integral 
over the interval 7 of the function / corresponding to the form < p*co. 

We have derived the important relations (13.8) and (13.9) from Definition 1, but 
they themselves could have been adopted as the original definitions. In particular, if 
D is an arbitrary domain in E' ! (not necessarily an interval), then, so as not to repeat 
the summation procedure, we set 


I f(t)dt l A---Adt k := [ f(t)dt l ■■■dt k , (13.8') 

Jd Jd 

and for a smooth surface given in the form ip : D — > S and a A>form co on it we set 


/ «=-/ 

J s=<p(D) Jd 


cp a>. 


(13.9') 
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If S is an arbitrary piecewise-smooth ^-dimensional surface and co is a A-form 
defined on the smooth pieces of S, then, representing S as the union [J ( S, of smooth 
parametrized surfaces that intersect only in sets of lower dimension, we set 

f w: = f w - (13.10) 

J S i ” 

In the absence of substantive physical or other problems that can be solved using 
(13.10), such a definition raises the question whether the magnitude of the integral of 
the partition |J ( S, is independent of the choice of the parametrization of its pieces. 

Let us verify that this definition is unambiguous. 


Proof We begin by considering the simplest case in which S is a domain D x in R* 
and tp : D, — > D x is a diffeomorphism of a domain D , C R* onto D x . In D x = S the 
A-form co has the form /(x)dx 1 A • • ■ A dx*. Then, on the one hand (13.8) implies 



dx 1 A • • • A dx* 



dx 1 • ■ ■ dx* . 


On the other hand, by (13.9') and (13.8'), 




deicp' {t) df 1 • • ■ dr*. 


But if det cp'{t) > 0 in I), , then by the theorem on change of variable in a multiple 
integral we have 


L 


/(x)dx 1 dx* 


D X =V(D,) 



deXq>'{t)dt l • ■ -dr*. 


Hence, assuming that there were coordinates x, . . . , x* in S = D x and curvilin- 
ear coordinates f 1 , . . . , f* of the same orientation class, we have shown that the value 
of the integral f s co is the same, no matter which of these two coordinate systems is 
used to compute it. 

We note that if the curvilinear coordinates f 1 , . . . , f* had defined the opposite ori- 
entation on S, that is, det cp'(t) < 0, the right- and left-hand sides of the last equality 
would have had opposite signs. Thus, one can say that the integral is well-defined 
only in the case of an oriented surface of integration. 

Now let q) x : D x — »■ S and tp t \ D t —*■ S be two parametrizations of the same 
smooth A-dimensional surface S and co a A -form on S. Let us compare the integrals 

I cp*co and f (p*co. (13.11) 

J D x J Dj 

Since cp, = cp x o (cpf 1 o cp t ) = cp x o <p, where cp = <p* o <p t : D, —> D x is a diffeomor- 
phism of D, onto D x , it follows that ipfco — cp*(cp*co) (see Eq. (12.30) of Sect. 12.5). 
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Hence one can obtain the form ip* to in D, by the change of variable x — <p(t) in the 
form (p*oj. But, as we have just verified, in this case the integrals (13.11) are equal 
if detip' (t) > 0 and differ in sign if detip' (t) < 0. 

Thus it has been shown that if cp t : D t -* S and <p x : D x —>■ S are parametriza- 
tions of the surface S belonging to the same orientation class, the integrals (13.1 1) 
are equal. The fact that the integral is independent of the choice of curvilinear coor- 
dinates on the surface S has now been verified. 

The fact that the integral (13.10) over an oriented piecewise-smooth surface S 
is independent of the method of partitioning |J ( .S', into smooth pieces follows from 
the additivity of the ordinary multiple integral (it suffices to consider a finer partition 
obtained by superimposing two partitions and verify that the value of the integral 
over the finer partition equals the value over each of the two original partitions). □ 

On the basis of these considerations, it now makes sense to adopt the following 
chain of formal definitions corresponding to the construction of the integral of a 
form explained in Definition 1. 

Definition 1' (Integral of a form over an oriented surface S C R" .) 

a) If the form f(x) dx 1 A • ■ • A dx A is defined in a domain D cR { , then 



b) If S C R" is a smooth A-dimensional oriented surface, ip : D — > S is a 
parametrization of it, and co is a A: -form on 5, then 



where the + sign is taken if the parametrization ip agrees with the given orientation 
of S and the - sign in the opposite case. 

c) If S is a piecewise-smooth A-dimensional oriented surface in R" and a> is a 
A -form on S (defined where S has a tangent plane), then 



where Si, . . . , S m , . . . is a decomposition of S into smooth parametrizable k- 
dimensional pieces intersecting at most in piecewise-smooth surfaces of smaller 
dimension. 

We see in particular that changing the orientation of a surface leads to a change 
in the sign of the integral. 
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13.1.3 Problems and Exercises 

1. a) Let jc, y be Cartesian coordinates on the plane R 2 . Exhibit the vector field 
whose work form is co = — dx H — dv. 

x z -\-y z x z +y z y 

b) Find the integral of the form co in a) along the following paths y; : 

Yl 9 Y2 9 

[0, 7r] 3 t — > (cos t, sinr) e R , [0, tt] 3 t — R (cos t, — sin t) e R , 

y 3 consists of a motion along the closed intervals joining the points (1, 0), (1, 1), 
(—1,1), (—1,0) in that order; ya consists of a motion along the closed intervals 
joining (1, 0), (1, —1), (—1, —1), (—1, 0) in that order. 

2. Let / be a smooth function in the domain D C R" and y a smooth path in D 
with initial point po e D and terminal point p\ e D. Find the integral of the form 
co = d / over y . 

3. a) Find the integral of the form co — dy A dz + dz A dx over the boundary of the 
standard unit cube in R 3 oriented by an outward-pointing normal. 

b) Exhibit a velocity field for which the form co in a) is the flux form. 

4. a) Let x, y, z be Cartesian coordinates in R". Exhibit a velocity field for which 
the flux form is 

x dy A dj + y dz A dx + z dx A dy 
" (x 2 + y 2 + z 2 ) 3 / 2 

b) Find the integral of the form co in a) over the sphere x 2 + y 2 + z~ — R 2 
oriented by the outward normal. 

c) Show that the flux of the field - ^ _^y 2 ^ 2 ) 3/2 across the sphere (x — 2) 2 + y 2 + 
Z 2 = 1 is zero. 

d) Verify that the flux of the field in c) across the torus whose parametric equa- 
tions are given in Example 4 of Sect. 12.1 is also zero. 

5. It is known that the pressure P, volume V. and temperature T of a given quantity 
of a substance are connected by an equation f{P, V, T) — 0, called the equation of 
state in thermodynamics. For example, for one mole of an ideal gas the equation 
of state is given by Clapeyron’s formula — R — 0, where R is the universal gas 
constant. 

Since P, V, T are connected by the equation of state, knowing any pair of them, 
one can theoretically determine the remaining one. Hence the state of any system 
can be characterized, for example, by points (V, P) of the plane R 2 with coordi- 
nates V, P . Then the evolution of the state of the system as a function of time will 
correspond to some path y in this plane. 

Suppose the gas is located in a cylinder in which a frictionless piston can move. 
By changing the position of the piston, we can change the state of the gas enclosed 
by the piston and the cylinder walls at the cost of doing mechanical work. Con- 
versely, by changing the state of the gas (heating it, for example) we can force the 
gas to do mechanical work (lifting a weight by expanding, for example). In this 
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Fig. 13.3 



problem and in Problems 6, 7, and 8 below, all processes are assumed to take place 
so slowly that the temperature and pressure are able to average out at each particular 
instant of time; thus at each instant of time the system satisfies the equation of state. 
These are the so-called quasi-static processes. 

a) Let y be a path in the VP-plane corresponding to a quasi-static transition of 
the gas enclosed by the piston and the cylinder walls from state Vo, P) to V\, P\. 
Show that the quantity A of mechanical work performed on this path is defined by 
the line integral A = f P d V. 

b) Find the mechanical work performed by one mole of an ideal gas in passing 
from the state Vo, Pq to state Vi, Pi along each of the following paths (Fig. 13.3): 
you , consisting of the isobar OL ( P = Pq) followed by the isochore LI (V = Vj); 
Yoki - consisting of the isochore OK (V = Vo) followed by the isobar KI (P = Pi); 
Yoi - consisting of the isotherm T — const (assuming that Po Vo = Pi Vi). 

c) Show that the formula obtained in a) for the mechanical work performed by 
the gas enclosed by the piston and the cylinder walls is actually general, that is, it 
remains valid for the work of a gas enclosed in any deformable container. 

6. The quantity of heat acquired by a system in some process of varying its states, 
like the mechanical work performed by the system (see Problem 5), depends not 
only on the initial and final states of the system, but also on the transition path. An 
important characteristic of a substance and the thermodynamic process performed 
by (or on) it is its heat capacity , the ratio of the heat acquired by the substance to 
the change in its temperature. A precise definition of heat capacity can be given as 
follows. Let x be a point in the plane of states F (with coordinates V, P or V, T 
or P, T) and e e TF X a vector indicating the direction of displacement from the 
point x. Let t be a small parameter. Let us consider the displacement from the state 
x to the state x + re along the closed interval in the plane F whose endpoints are 
these states. Let 4g(x, re) be the heat acquired by the substance in this process and 
AT (x, re) the change in the temperature of the substance. 

The heat capacity C — C(x, e) of the substance (or system) corresponding to the 
state x and the direction e of displacement from that state is 


C(x, re) = lim A @ (x ' te \ 
o AT (x, re) 


In particular, if the system is thermally insulated, its evolution takes place without 
any exchange of heat with the surrounding medium. This is a so-called adiabatic 
process. The curve in the plane of states F corresponding to such a process is called 
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an adiabatic. Hence, zero heat capacity of the system corresponds to displacement 
from a given state x along an adiabatic. 

Infinite heat capacity corresponds to displacement along an isotherm T — const. 

The heat capacities at constant volume Cy = C(x, ey) and at constant pressure 
Cp = C(x, ep), which correspond respectively to displacement along an isochore 
V — const and an isobar P — const, are used particularly often. Experiment shows 
that in a rather wide range of states of a given mass of substance, each of the quan- 
tities Cy and Cp can be considered practically constant. The heat capacity corre- 
sponding to one mole of a given substance is customarily called the molecular heat 
capacity and is denoted (in contrast to the others) by upper case letters rather than 
lower case. We shall assume that we are dealing with one mole of a substance. 

Between the quantity A Q of heat acquired by the substance in the process, the 
change AU in its internal energy, and the mechanical work A A it performs, the 
law of conservation of energy provides the connection AQ — AU + A A. Thus, 
under a small displacement re from state x e F the heat acquired can be found as 
the value of the form SQ dU + P dV at the point x on the vector re e T F x (for 
the formula P dV for the work see Problem 5c)). Hence if T and V are regarded 
as the coordinates of the state and the displacement parameter (in a nonisothermal 
direction) is taken as T, then we can write 


C = lim^ = ^ 
t->o AT 3 T 


3 U dV dV 

b P — . 

3 v dr dr 


The derivative ^ determines the direction of displacement from the state xe F 
in the plane of states with coordinates T and V . In particular, if ^ = 0 then the 
displacement is in the direction of the isochore V — const, and we find that Cy = 
jjt. If P = const, then ^ = (|^)p =const . (In the general case V = V(P, T ) is the 
equation of state f(P,V,T) = 0 solved for V.) Hence 



where the subscripts P, V, and T on the right-hand side indicate the parameter 
of state that is fixed when the partial derivative is taken. Comparing the resulting 
expressions for Cy and Cp, we see that 


Cp - Cy — 





p 


By experiments on gases (the Joule '-Thomson experiments) it was established 
and then postulated in the model of an ideal gas that its internal energy depends only 
on the temperature, that is, (jM-)t = 0. Thus for an ideal gas Cp — Cy = P(|^)p. 


'G.P. Joule (1818-1889) - British physicist who discovered the law of thermal action of a current 
and also determined, independently of Mayer, the mechanical equivalent of heat. 
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Taking account of the equation PV — RT for one mole of an ideal gas, we obtain the 
relation Cp — Cy — R from this, known as Mayer’s equation 2 in thermodynamics. 

The fact that the internal energy of a mole of gas depends only on temperature 
makes it possible to write the form 8 Q as 

dU 

80= — dT + PdV = C v dT + PdV. 

^3 T 

To compute the quantity of heat acquired by a mole of gas when its state varies 
over the path y one must consequently find the integral of the form Cy dT + P dV 
over y . It is sometimes convenient to have this form in terms of the variables V and 
P. If we use the equation of state PV = RT and the relation C p — Cy = R, we 
obtain 

8Q = C P ?-dV + Cv^-dP. 

K K 


a) Write the formula for the quantity Q of heat acquired by a mole of gas as its 
state varies along the path y in the plane of states F. 

b) Assuming the quantities Cp and Cy are constant, find the quantity Q corre- 
sponding to the paths you, yoKi, and yoi in Problem 5b). 

c) Find (following Poisson) the equation of the adiabatic passing through the 
point (Vo, P(\ ) in the plane of states F with coordinates V and P. (Poisson found that 
pyCp/Cy _ cons t on an adiabatic. The quantity Cp /Cy is the adiabatic constant 
of the gas. For air C p/Cy ~ 1 .4.) Now compute the work one must do in order to 
confine a thermally isolated mole of air in the state (Vo, Pq) to the volume Vi = ^ Vo. 

7. We recall that a Carnot cycle 3 of variation in the state of the working body of a 
heat engine (for example, the gas under the piston in a cylinder) consists of the fol- 
lowing (Fig. 13.4). There are two energy-storing bodies, a heater and a cooler (for 
example, a steam boiler and the atmosphere) maintained at constant temperatures T\ 
and 73 respectively (7) > 73 ). The working body (gas) of this heat engine, having 
temperature 7) in State 1, is brought into contact with the heater, and by decreas- 
ing the external pressure along an isotherm, expands quasi-statically and moves to 
State 2. In the process the engine borrows a quantity of heat Q \ from the heater and 
performs mechanical work A u against the external pressure. In State 2 the gas is 
thermally insulated and forced to expand quasi-statically to State 3, until its tem- 
perature reaches 73, the temperature of the cooler. In this process the engine also 
performs a certain quantity of work A 23 against the external pressure. In State 3 the 
gas is brought into contact with the cooler and compressed isothermically to State 4 
by increasing the pressure. In this process work is done on the gas (the gas itself per- 
forms negative work A 34 ), and the gas gives up a certain quantity of heat Qp to the 


2 J.P. Mayer (1814-1878) - German scholar, a physician by training; he stated the law of conserva- 
tion and transformation of energy and found the mechanical equivalent of heat. 

3 N.L.S. Carnot (1796-1832) - French engineer, one of the founders of thermodynamics. 
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cooler. State 4 is chosen so that it is possible to return from it to State 1 by a quasi- 
static compression along an adiabatic. Thus the gas is returned to State 1. In the 
process it is necessary to perform some work on the gas (and the gas itself performs 
negative work A41 ). As a result of this cyclic process (a Carnot cycle) the internal 
energy of the gas (the working body of the engine) obviously does not change (af- 
ter all, we have returned to the initial state). Therefore the work performed by the 
engine is A = A 12 + A23 + A34 + A 41 = Q\ - Q 2 . 

The heat Q 1 acquired from the heater went only partly to perform the work A . 
It is natural to call the quantity 77 = ^ 1 g the efficiency of the heat engine 

under consideration. 

a) Using the results obtained in a) and c) of Problem 6, show that the equality 

holds for a Carnot cycle. 

b) Now prove the following theorem, the first of Carnot’s two famous theorems. 
The efficiency of a heat engine working along a Carnot cycle depends only on the 
temperatures T\ and T 2 of the heater and cooler. (It is independent of the structure 
of the engine or the form of its working body.) 

8. Let y be a closed path in the plane of states F of the working body of an arbitrary 
heat engine (see Problem 7) corresponding to a closed cycle of work performed by 
it. The quantity of heat that the working body (a gas, for example) exchanges with 
the surrounding medium and the temperature at which the heat exchange takes place 
are connected by the Clausius inequality f ^ < 0. Here SQ is the heat exchange 
form mentioned in Problem 6. 

a) Show that for a Carnot cycle (see Problem 7), the Clausius inequality becomes 
equality. 

b) Show that if the work cycle y can be run in reverse, then the Clausius in- 
equality becomes equality. 

c) Let y\ and y 2 be the parts of the path 7 on which the working body of a 
heat engine acquires heat from without and imparts it to the surrounding medium 
respectively. Let T\ be the maximal temperature of the working body on y\ and 73 
its minimal temperature on 72. Finally, let Q\ be the heat acquired on 71 and Q 2 
the heat given up on 72. Based on Clausius’ inequality, show that 

d) Obtain the estimate rj < 7 1 Tl for the efficiency of any heat engine (see Prob- 
lem 7). This is Carnot’s second theorem. (Estimate separately the efficiency of a 
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steam engine in which the maximal temperature of the steam is at most 150 °C, that 
is, T] = 423 K, and the temperature of the cooler - the surrounding medium - is of 
the order 20 °C, that is 73 = 29 1 K.) 

e) Compare the results of Problems 7b) and 8d) and verify that a heat engine 
working in a Carnot cycle has the maximum possible efficiency for given values of 
T\ and 73. 

9. The differential equation ^ is said to have variables separable. It is usu- 

ally rewritten in the form g(y) dy = fix ) dr, in which “the variables are separated,” 
then “solved” by equating the primitives / g(y) dy = / fix) dx. Using the language 
of differential forms, now give a detailed mathematical explanation for this algo- 
rithm. 


13.2 The Volume Element. Integrals of First and Second Kind 
13.2.1 The Mass of a Lamina 

Let S be a lamina in Euclidean space R" . Assume that we know the density p (x ) 
(per unit area) of the mass distribution on S. We ask how one can determine the total 
mass of S. 

In order to solve this problem it is necessary first of all to take account of the fact 
that the surface density p(x) is the limit of the ratio Am of the quantity of mass on 
a portion of the surface in a neighborhood of x to the area A<j of that same portion 
of the surface, as the neighborhood is contracted to x. 

By breaking S into small pieces .S', and assuming that p is continuous on S, we 
can find the mass of 5) , neglecting the variation of p within each small piece, from 
the relation 


Ami ^ P(xj)Aaj , 


in which Act, is the area of the surface 5/ and x; e .S', . 

Summing these approximate equalities and passing to the limit as the partition is 
refined, we find that 



(13.12) 


The symbol for integration over the surface S here obviously requires some clar- 
ification so that computational formulas can be derived from it. 

We note that the statement of the problem itself shows that the left-hand side 
of Eq. (13.12) is independent of the orientation of .S’, so that the integral on the 
right-hand side must have the same property. At first glance this appears to contrast 
with the concept of an integral over a surface, which was discussed in detail in 
Sect. 13.1. The answer to the question that thus arises is concealed in the definition 
of the surface element d a , to whose analysis we now turn. 
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13.2.2 The Area of a Surface as the Integral of a Form 

Comparing Definition 1 of Sect. 13.1 for the integral of a form with the construction 
that led us to the definition of the area of a surface (Sect. 12.4), we see that the 
area of a smooth k-dimensional surface S embedded in the Euclidean space R" and 
given parametrically by cp : D S, is the integral of a form £2, which we shall 
provisionally call the volume element on the surface S. It follows from relation 
(12.10) of Sect. 12.4 that £2 (more precisely cp*£2) has the form 

co = Jdet(gjj)(t)dt l a--- A dr*, (13.13) 

in the curvilinear coordinates cp : D -* S (that is, when transferred to the domain D). 
Here g, 7 (t) = <ff, 7 = 1,..., L 

To compute the area of S over a domain D in a second parametrization <p : 
D —> S, one must correspondingly integrate the form 

S = ^det (gij)(t) df 1 A ■ • • A dt k , (13.14) 

where g, 7 (f) = <ff , §y>, i, j = 1, . . . , k. 

We denote by \[r the diffeomorphism (p~ l o cp : D — ► D that provides the change 
from t coordinates to t coordinates on .S'. Earlier we have computed (see Remark 5 
of Sect. 12.4) that 


yj det(g, 7 )(f) = ^det (gij)(t) ■ |det^(0|. (13.15) 

At the same time, it is obvious that 

if* co — yj~de\{gij)(fr (F)), det i/'(t)dt l A ■ ■ ■ A d t k . (13.16) 

Comparing the equalities (13. 13)— (13. 16), we see that i lr*co — co if deti jf'(t) > 0 
and i Jf*oo = —5 if det \[r'(t) < 0. If the forms co and 5 were obtained from the same 
form £2 on S through the transfers q>* and <p* , then we must always have the equality 
if*(np*£2) = cp*£2 or, what is the same, i lr*co = co. 

We thus conclude that the forms on the parametrized surface S that one must 
integrate in order to obtain the areas of the surface are different: they differ in sign 
if the parametrizations define different orientations on S ; these forms are equal for 
parametrizations that belong to the same orientation class for the surface S. 

Thus the volume element £2 on S must be determined not only by the surface S 
embedded in R", but also by the orientation of S. 

This might appear paradoxical: in our intuition, the area of a surface should not 
depend on its orientation! 
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But after all, we arrived at the definition of the area of a parametrized surface via 
an integral, the integral of a certain form. Hence, if the result of our computations is 
to be independent of the orientation of the surface, it follows that we must integrate 
different forms when the orientation is different. 

Let us now turn these considerations into precise definitions. 


13.2.3 The Volume Element 


Definition 1 If is an oriented Euclidean space with inner product ( , } , the volume 
element on corresponding to a particular orientation and the inner product (, } 
is the skew-symmetric /.'-form that assumes the value 1 on an orthonormal frame of 
some orientation class. 


The value of the />form on the frame ei , . . . , e& obviously determines this form. 
We remark also that the form Q is determined not by an individual orthonormal 
frame, but only by its orientation class. 


Proof In fact, if ej, .. . , e* and ei e* are two such frames in the same orien- 

tation class, then the transition matrix O from the second basis to the first is an 
orthogonal matrix with det 0 = 1. Hence 


f2(e i, tk) = det O ■ L?(e i, . . . , e*) = f2(ei, . . . , e*). □ 


If an orthonormal basis ei,...,e/t is fixed in R* and jr 1 ,...,7t k are the 
projections of on the corresponding coordinate axes, obviously it 1 A • • • A 
7t k (e i, . . . , e*) = 1 and 

= 7T 1 A • • • A n k . 


Thus, 




4 * 




This is the oriented volume of the parallelepiped spanned by the ordered set of 
vectors § l , . . . , £ k . 


Definition 2 If the smooth /.'-dimensional oriented surface S is embedded in a Eu- 
clidean space M", then each tangent plane TS X to S has an orientation consistent 
with the orientation of S and an inner product induced by the inner product in R" ; 
hence there is a volume element £2(x). The /.'-form Q that arises on S in this way is 
the volume element on S induced by the embedding of 5 in R” . 
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Definition 3 The area of an oriented smooth surface is the integral over the surface 
of the volume element corresponding to the orientation chosen for the surface. 

This definition of area, stated in the language of forms and made precise, is of 
course in agreement with Definition 1 of Sect. 12.4, which we arrived at by consid- 
eration of a smooth ^-dimensional surface S C R' 1 defined in parametric form. 

Proof Indeed, the parametrization orients the surface and all its tangent planes TS X . 
If £ I ..... ^. is a frame of a fixed orientation class in TS X , it follows from Defini- 
tions 2 and 3 for the volume element Q that Qix)(f | % k ) > 0. But then (see 

Eq. (12.7) of Sect. 12.4) 


•.«*) = Vdet (<€,-,$;>) 




(13.17) 

□ 


We note that the form Q(x) itself is defined on any set £i, . . . , % k of vectors in 
TS X , but Eq. (13.17) holds only on frames of a given orientation class in TS X . 

We further note that the volume element is defined only on an oriented surface, so 
that it makes no sense, for example, to talk about the volume element on a Mobius 
band in K 3 , although it does make sense to talk about the volume element of each 
orientable piece of this surface. 

Definition 4 Let S' be a k-dimensional piecewise-smooth surface (orientable or not) 
in M", and Si, . . . , S m , . . . a finite or countable number of smooth parametrized 
pieces of it intersecting at most in surfaces of dimension not larger than k — 1 and 
such that S = [J ( S; . 

The area (or k-dimensional volume) of S is the sum of the areas of the sur- 
faces Si. 

In this sense we can speak of the area of a Mobius band in R 3 or, what is the 
same, try to find its mass if it is a material surface with matter having unit density. 

The fact that Definition 4 is unambiguous (that the area obtained is independent 
of the partition 5) , . . . , S m , . . . of the surface) can be verified by traditional reason- 
ing. 


13.2.4 Expression of the Volume Element in Cartesian Coordinates 

Let S be a smooth hypersurface (of dimension n — 1) in an oriented Euclidean 
space K" endowed with a continuous field of unit normal vectors r](x ), x e S, which 
orients it. Let V be the n -dimensional volume in R" and Q the (n — 1) -dimensional 
volume element on S. 

If we take a frame . . . , in the tangent space TS X from the orientation 
class determined by the unit normal n(x) to TS X , we can obviously write the follow- 
ing equality: 


V(x)(ri, £„_!) = ■£(*)(£!,...,$„_!). 


(13.18) 
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Proof This fact follows from the fact that under the given hypotheses both sides 
are nonnegative and equal in magnitude because the volume of the parallelepiped 

spanned by rj, § n -l i s ^ le area °f the base £2(x)(i;i, . . . , £„_i) multiplied 

by the height | j; | = 1 . □ 

But, 


r, 1 


•$n- 1 ) = 

rf 


I Cl ••• Cl 

n ^ 

= ^(— l)' -1 ^' (x) dx 1 A • • • A dx 1 A • • • A dx" (f | , 


1 = 1 


Here the variables x 1 , . . . , x" are Cartesian coordinates in the orthonormal ba- 
sis ei , . . . , e„ that defines the orientation, and the frown over the differential dx' 
indicates that it is to be omitted. 

Thus we obtain the following coordinate expression for the volume element on 
the oriented hypersurface S cR": 


Q = ^ (— 1 )' “ 1 if (x) dx 1 A • • • A dx' A • • • Adx"(^ $„_!). (13.19) 

i = 1 

At this point it is worthwhile to remark that when the orientation of the surface 
is reversed, the direction of the normal j/(x) reverses, that is, the form 12 is replaced 
by the new form —12. 

It follows from the same geometric considerations that for a fixed value of i e 

{ 1 , 


(>/(x), e,-)l2(C .... Cl) = V(x)(e,-, . . . , (13.20) 


This last equality means that 

f(x)^(x) = (-1 ) !_1 dx 1 A • • • A dx ! ' A • • • A dx" (§ ! , . . . , ?„_!>. (13.21) 

For a two-dimensional surface S in R" the volume element is most often denoted 
d a or dS. These symbols should not be interpreted as the differentials of some forms 
cr and S', they are only symbols. If x, y, z are Cartesian coordinates on R 3 , then in 
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this notation relations (13.19) and (13.21) can be written as follows: 

dcr = cos ai dy A dz + cos <22 dz A dx + cos <23 dx A dy, 
cos a 1 da = d y A d~, (oriented areas of the projections 

cos o ?2 dcr = dz A dx, on the coordinate planes), 

cos 0:3 dcr = dx A dy, 

Here (cosai , coscyt. cos 0 : 3 ) (x) are the direction cosines (coordinates) of the unit 
normal vector r](x) to S at the point x e S. In these equalities (as also in (13.19) and 
(13.21)) it would of course have been more correct to place the restriction sign |$ on 
the right-hand side so as to avoid misunderstanding. But, in order not to make the 
formulas cumbersome, we confine ourselves to this remark. 


13.2.5 Integrals of First and Second Kind 

Integrals of type (13.12) arise in a number of problems, a typical representative of 
which is the problem considered above of determining the mass of a surface whose 
density is known. These integrals are often called integrals over a surface or integrals 
of first kind. 

Definition 5 The integral of a function p over an oriented surface S is the integral 

J ptt (13.22) 

of the differential form pQ . where Q is the volume element on S (corresponding to 
the orientation of S chosen in the computation of the integral). 

It is clear that the integral (13.22) so defined is independent of the orientation 
of S, since a reversal of the orientation is accompanied by a corresponding replace- 
ment of the volume element. 

We emphasize that it is not really a matter of integrating a function, but rather 
integrating a form pQ of special type over the surface S with the volume element 
defined on it. 

Definition 6 If S is a piecewise-smooth (orientable or non-orientable) surface and 
p is a function on S, then the integral (13.22) of p over the surface S is the sum 
f s pQ of the integrals of p over the parametrized pieces S\, , S m .... of the 
partition of S described in Definition 4. 

The integral (13.22) is usually called a surface integral of first kind. 

For example, the integral (13.12), which expresses the mass of the surface S in 
terms of the density p of the mass distribution over the surface, is such an integral. 
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To distinguish integrals of first kind, which are independent of the orientation of 
the surface, we often refer to integrals of forms over an oriented surface as surface 
integrals of second kind. 

We remark that, since all skew-symmetric forms on a vector space whose degrees 
are equal to the dimension of the space are multiples of one another, there is a 
connection co = pE2 between any k-form a> defined on a k-dimensional orientable 
surface S and the volume element Q on S. Here p is some function on S depending 
on a>. Hence 



That is, every integral of second kind can be written as a suitable integral of first 
kind. 

Example 1 The integral (13.2') of Sect. 13.1, which expresses the work on the path 
y : [a, b] — »• R" , can be written as the integral of first kind 


(F, e) dv, 


(13.23) 


where s is arc length on y, ds is the element of length (a 1-form), and e is a unit 
velocity vector containing all the information about the orientation of y . From the 
point of view of the physical meaning of the problem solved by the integral (13.23), 
it is just as informative as the integral (13.1) of Sect. 13.1. 

Example 2 The flux (13.3) of Sect. 13.1 of the velocity field Y across a surface 
S C M” oriented by unit normals n(x) can be written as the surface integral of first 
kind 

J (V, n) da. (13.24) 

The information about the orientation of S here is contained in the direction of the 
field of normals n. 

The geometric and physical content of the integrand in (13.24) is just as transpar- 
ent as the corresponding meaning of the integrand in the final computational formula 
(13.6) of Sect. 13.1. 

For the reader’s information we note that quite frequently one encounters the 
notation ds := eds and d<r := nda, which introduce a vector element of length and 
a vector element of area. In this notation the integrals (13.23) and (13.24) have the 
form 




and 


which are very convenient from the point of view of physical interpretation. For 
brevity the inner product (A, B) of the vectors A and B is often written A • B. 
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Example 3 Faraday’s law 4 asserts that the electromotive force arising in a closed 


conductor r in a variable magnetic field B is proportional to the rate of variation of 
the flux of the magnetic field across a surface S bounded by r. Let E be the electric 
field intensity. A precise statement of Faraday’s law can be given as the equality 



l 


E • ds = - 


The circle in the integration sign over r is an additional reminder that the integral 
is being taken over a closed curve. The work of the field over a closed curve is 
often called the circulation of the field along this curve. Thus by Faraday’s law 
the circulation of the electric field intensity generated in a closed conductor by a 
variable magnetic field equals the rate of variation of the flux of the magnetic field 
across a surface S bounded by taken with a suitable sign. 

Example 4 Ampere’s law 5 



(where B is the magnetic field intensity, j is the current density vector, and eo and 
c are dimensioning constants) asserts that the circulation of the intensity of a mag- 
netic field generated by an electric current along a contour r is proportional to the 
strength of the current flowing across the surface S bounded by the contour. 

We have studied integrals of first and second kind. The reader might have no- 
ticed that this terminological distinction is very artificial. In reality we know how to 
integrate, and we do integrate, only differential forms. No integral is ever taken of 
anything else (if the integral is to claim independence of the choice of the coordinate 
system used to compute if). 


13.2.6 Problems and Exercises 

1. Give a formal proof of Eqs. (13.18) and (13.20). 

2. Let y be a smooth curve and d.v the element of arc length on y . 

a) Show that 



for any function / on y for which both integrals are defined. 


4 M. Faraday (1791-1867) - outstanding British physicist, creator of the concept of an electromag- 
netic field. 

5 A.M. Ampere (1775-1836) - French physicist and mathematician, one of the founders of modern 
electrodynamics. 
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b) Verify that if | f(s)\ < M on y and l is the length of y , then 


l 


/(x)d.v 


< Ml. 


c) State and prove assertions analogous to a) and b) in the general case for an 
integral of first kind taken over a ^-dimensional smooth surface. 

3. a) Show that the coordinates (Xq , Xq, Xq) of the center of masses distributed with 
linear density p(x) along the curve y should be given by the relations 

x‘ 0 / p(x)d^= / x'p(x)di, i=l,2, 3. 

J y J y 

b) Write the equation of a helix in R 3 and find the coordinates of the center of 
mass of a piece of this curve, assuming that the mass is distributed along the curve 
with constant density equal to 1 . 

c) Exhibit formulas for the center of masses distributed over a surface S with 
surface density p and find the center of masses that are uniformly distributed over 
the surface of a hemisphere. 

d) Exhibit the formulas for the moment of inertia of a mass distributed with 
density p over the surface S. 

e) The tire on a wheel has mass 30 kg and the shape of a torus of outer diameter 
1 m and inner diameter 0.5 m. When the wheel is being balanced, it is placed on 
a balancing lathe and rotated to a velocity corresponding to a speed of the order 
of 100 km/hr, then stopped by brake pads rubbing against a steel disk of diameter 
40 cm and width 2 cm. Estimate the temperature to which the disk would be heated 
if all the kinetic energy of the spinning tire went into heating the disk when the 
wheel was stopped. Assume that the heat capacity of steel is c = 420 J/(kg-K). 

4. a) Show that the gravitational force acting on a point mass m o located at 
(xo, yo, zo) d ue t° a material curve y having linear density p is given by the for- 
mula 

F = Gm 0 [ -^rds, 

Jy |r | 3 

where G is the gravitational constant and r is the vector with coordinates (x — 

*o,;y-;yo,z-zo)- 

b) Write the corresponding formula in the case when the mass is distributed over 
a surface S. 

c) Find the gravitational field of a homogeneous material line. 

d) Find the gravitational field of a homogeneous material sphere. (Exhibit the 
field both outside the ball bounded by the sphere and inside the ball.) 

e) Find the gravitational field created in space by a homogeneous material ball 
(consider both exterior and interior points of the ball). 

f) Regarding the Earth as a liquid ball, find the pressure in it as a function of 
the distance from the center. (The radius of the Earth is 6400 km, and its average 
density is 6 g/cm 3 .) 
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5 . Let y\ and yi be two closed conductors along which currents J\ and h re- 
spectively are flowing. Let dsi and ds2 be the vector elements of these conductors 
corresponding to the directions of current in them. Let the vector R12 be directed 
from dsi to ds2, and R21 = — Ri2- 

According to the Biot-Savart law 6 the force dFi2 with which the first element 
acts on the second is 

dF 12 = y [ds 2 ,[dsi,Ri 2 ]], 

C5IR12I- 

where the brackets denote the vector product of the vectors and co is a dimensioning 
constant. 

a) Show that, on the level of an abstract differential form, it could happen that 
dFi2 7^ — dF2i in the differential Biot-Savart formula, that is, “the reaction is not 
equal and opposite to the action.” 

b) Write the (integral) formulas for the total forces F12 and F21 for the interac- 
tion of the conductors y\ and Y2 and show that F12 = — F21. 

6. The co-area formula (the Kronrod—Federer formula). 

Let M m and N n be smooth surfaces of dimensions m and n respectively, embed- 
ded in a Euclidean space of high dimension (M m and N n may also be abstract Rie- 
mannian manifolds, but that is not important at the moment). Suppose that m > n. 

Let / : M m —r N n be a smooth mapping. When m > n, the mapping d f(x) : 
T x M m Tf( x) N n has a nonempty kernel kerd/(x). Let us denote by T^~M m the 
orthogonal complement of kerd/(x), and by ./(/, x ) the Jacobian of the mapping 
df (x)It-i M m '■ T x M m — » T f( X )N n . If m — n, then J(f,x ) is the usual Jacobian. 

Let dvk(p) denote the volume element on a ^-dimensional surface at the point p. 
We shall assume that vq(E) = card E, where Vk ( E ) is the ^-volume of E. 

a) Using Fubini’s theorem and the rank theorem (on the local canonical form 
of a smooth mapping) if necessary, prove the following formula of Kronrod and 
Federer: f M „ J(f,x)dv m (x) = f N „ n m _„(/ _1 (y)) dv n (y). 

b) Show that if A is a measurable subset of M m , then 

[ J(f,x)dv m (x) = f v m - n (A n f~ l (yj) dn„(y). 

Ja Jn " 

This is the general Kronrod-Federer formula. 

c) Prove the following strengthening of Sard’s theorem (which in its simplest 
version asserts that the image of the set of critical points of a smooth mapping has 
measure zero). (See Problem 8 of Sect. 1 1 . 5 .) 

Suppose as before that / : M’" — > N" is a smooth mapping and K is a compact 
set in M m on which rank df{x) < n for all x e K . 

Then f N „ v m - n (K (T f~ 1 (y)) dv n (v) = 0 . Use this result to obtain in addition the 
simplest version of Sard’s theorem stated above. 


6 Biot (1774-1862), Savart (1791-1841) - French physicists. 
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d) Verify that if /:£>-* R and u : D — > R are smooth functions in a regular 
domain D C R" and u has no critical points in D, then 



e) Let V f(t) be the measure (volume) of the set [x e D fix) > f}, and let the 
function / be nonnegative and bounded in the domain D . 

Show that f D fdv — —J m tdVfit) — f£° Vfit)dt. 

f) Let <p e C (1) (R,R+) and <p(0) = 0, while / e C (1) (£>,R) and V\ f \ (t) 
is the measure of the set [x e D \ \f{x)\ > t}. Verify that f D tp o fdv = 

/ 0 °V(f)V|/|(f)dL 


13.3 The Fundamental Integral Formulas of Analysis 

The most important formula of analysis is the Newton-Leibniz formula (funda- 
mental theorem of calculus). In the present section we shall obtain the formulas of 
Green, Gauss-Ostrogradskii, and Stokes, which on the one hand are an extension of 
the Newton-Leibniz formula, and on the other hand, taken together, constitute the 
most-used part of the machinery of integral calculus. 

In the first three subsections of this section, without striving for generality in our 
statements, we shall obtain the three classical integral formulas of analysis using 
visualizable material. They will be reduced to one general Stokes formula in the 
fourth subsection, which can be read formally independently of the others. 


13.3.1 Green’s Theorem 

Green’s 7 theorem is the following. 

Proposition 1 Let R 2 be the plane with a fixed coordinate grid x. y. and let D be 
a compact domain in this plane bounded by piecewise-smooth curves. Let P and Q 
be smooth functions in the closed domain D. Then the following relation holds: 



(13.25) 


in which the right-hand side contains the integral over the boundary 3 D of the 
domain D oriented consistently with the orientation of the domain D itself. 


7 G. Green (1793-1841) - British mathematician and mathematical physicist. Newton’s grave in 
Westminster Abbey is framed by five smaller gravestones with brilliant names: Faraday, Thomson 
(Lord Kelvin), Green, Maxwell, and Dirac. 
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Fig. 13.5 
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We shall first consider the simplest version of (13.25) in which D is the square 
/ = {(i, j)et 2 | 0 < x < 1, 0 < y < 1} and Q = 0 in I . Then Green’s theorem 
reduces to the equality 



(13.26) 


which we shall prove. 


Proof Reducing the double integral to an iterated integral and applying the funda- 
mental theorem of calculus, we obtain 


f P (x , 0) dx + f P (x , 1 ) dx . 

0 Jo 

The proof is now finished. What remains is a matter of definitions and interpre- 
tation of the relation just obtained. The point is that the difference of the last two 
integrals is precisely the right-hand side of relation (13.26). 

Indeed, the piecewise- smooth curve 3/ breaks into four pieces (Fig. 13.5), which 
can be regarded as parametrized curves 


!L 


3 P 

— d.v dy = 

D dy 


^dy = 
dy 


/V 

f (P(x, 1)- P(x,2))dx = - 
Jo 


n : [0, 1] -* 

K2 : [0, 1] -» 
K3 : [0, 1] 

K4 : (0, 1] 


where x 


vi 


n 


(x,0), 


where y i— (1 , y) 


where x 


Y3 


V4 


(x,l). 


where y i — > (0, y). 


By dehnition of the integral of the 1-form co—P d.v over a curve 

f P(x,y)dx:= f Y*(P(x,y) dx) := f P (x , 0) tLr , 

Jyi J[ 0,1] ' Jo 

[ P(x,y)dx := f yf(P(x, y) dx) := [ 0dy = 0, 
Jy 9 ’ Ao.ll ’ Jo 
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f P(x,y)dx:— f y^(P(x, y) dx) := f P(x, l)dx, 

Jyi ’ 0,1] ~ ' JO 

[ P(x,y)dx := f y£ (P(x, y) dx) := [ 0dy = 0, 

Jr 4 " J [0,1] Jo 

and, in addition, by the choice of the orientation of the boundary of the domain, 
taking account of the orientations of y \ , yi. yj,, V 4 , it is obvious that 

[ C0=[ CO + [ C0 + [ C0 + f (0 = f (0 f CO — f CO — f (0, 

Jdl Jy i J y? J —y\ J —va J v\ Jv? J vi J va 


where — y, is the curve y , taken with the orientation opposite to the one defined 
by Yi. 

Thus Eq. (13.26) is now verified. □ 


It can be verified similarly that 




9x j;)i 

Adding (13.26) and (13.27), we obtain Green’s formula 


d q dP\ r 

— — )dxdv= / Pdx+Qdy 

Jdl 


(13.27) 


/ \ dx dy J 


(13.250 


for the square I . 

We remark that the asymmetry of P and Q in Green’s formula (13.25) and in 
Eqs. (13.26) and (13.27) comes from the asymmetry of x and y: after all, x and y 
are ordered, and it is that ordering that gives the orientation in R 2 and in I . 

In the language of forms, the relation (13.250 just proved can be rewritten as 



(13.25") 


where o> is an arbitrary smooth form on I . The integrand on the right-hand side here 
is the restriction of the form <y to the boundary 3/ of the square I . 

The proof of relation (13.26) just given admits an obvious generalization: If D v is 
not a square, but a “curvilinear quadrilateral” whose lateral sides are vertical closed 
intervals (possibly degenerating to a point) and whose other two sides are the graphs 
of piecewise-smooth functions <pi(x) < (pjix) over the closed interval [a, /;] of the 
x-axis, then 



dx dy = — 



P dx. 


(13.26') 
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Fig. 13.6 


y 



X 


Similarly, if there is such a "quadrilateral” D x with respect to the y-axis, that is, 
having two horizontal sides, then for it we have the equality 



Now let us assume that the domain D can be cut into a finite number of domains 
of type D y (Fig. 13.6). Then a formula of the form (13.26') also holds for that 
region D. 

Proof In fact, by additivity, the double integral over the domain D is the sum of 
the integrals over the pieces of type D y into which I) is divided. Formula (13.26') 
holds for each such piece, that is, the double integral over that piece equals the 
integral of P dx over the oriented boundary of the piece. But adjacent pieces induce 
opposite orientations on their common boundary, so that when the integrals over 
the boundaries are added, all that remains after cancellation is the integral over the 
boundary 3 D of the domain I) itself. □ 

Similarly, if D admits a partition into domains of type D , , an equality of type 
(13.27') holds for it. 

We agree to call domains that can be cut both into pieces of type D x and into 
pieces of type D y elementary domains. In fact, this class is sufficiently rich for all 
practical applications. 

By writing both relations (13.26') and (13.27') for a simple domain, we obtain 
(13.25) by adding them. 

Thus, Green’s theorem is proved for simple domains. 

We shall not undertake any further sharpenings of Green’s formula at this point 
(on this account see Problem 2 below), but rather demonstrate a second, very fruitful 
line of reasoning that one may pursue after establishing Eqs. (13.25') and (13.25"). 

Suppose the domain C has been obtained by a smooth mapping <p : 1 —*■ C of the 
square / . If co is a smooth 1-form on C, then 



(13.28) 


The exclamation point here distinguishes the equality we have already proved 
(see (13.25")); the extreme terms in these equalities are definitions or direct conse- 
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quences of them; the remaining equality, the second from the left, results from the 
fact that exterior differentiation is independent of the coordinate system. 

Hence Green’s formula also holds for the domain C. 

Finally, if it is possible to cut any oriented domain D into a finite number of 
domains of the same type as C, the considerations already described involving the 
mutual cancellation of the integrals over the portions of the boundaries of the C; 
inside D imply that 



(13.29) 


that is, Green’s formula also holds for D. 

It can be shown that every domain with a piecewise-smooth boundary belongs 
to this last class of domains, but we shall not do so, since we shall describe below 
(Chap. 15) a useful technical device that makes it possible to avoid such geometric 
complications, replacing them by an analytic problem that is comparatively easy to 
solve. 

Let us consider some examples of the use of Green’s formula. 

Example 1 Let us set P = —y, Q = x in (13.25). We then obtain 



where a{D) is the area of I). Using Green’s formula one can thus obtain the follow- 
ing expression for the area of a domain on the plane in terms of line integrals over 
the oriented boundary of the domain: 



It follows in particular from this that the work A = f P (\V performed by a 
heat engine in changing the state of its working substance over a closed cycle y 
equals the area of the domain bounded by the curve y in the PV-plane of states (see 
Problem 5 of Sect. 13.1). 


Example 2 Let S = {(.r,y)eR 2 |r + )i 2 < 1} be the closed disk in the plane. We 


shall show that any smooth mapping / : B B of the closed disk into itself has at 
least one fixed point (that is, a point p e B such that f(p ) = p). 

Proof Assume that the mapping / has no fixed points. Then for every point p e B 
the ray with initial point fip) passing through the point p and the point (pip) e 
3 B where this ray intersects the circle bounding B are uniquely determined. Thus 
a mapping q> : B — »■ dB would arise, and it is obvious that the restriction of this 
mapping to the boundary would be the identity mapping. Moreover, it would have 
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the same smoothness as the mapping / itself. We shall show that no such mapping 
<p can exist. 

In the domain R 2 \0 (the plane with the origin omitted) let us consider the form 
co = that we encountered in Sect. 1 3 . 1 . It can be verified immediately that 

d co — 0. Since 9 B C R 2 \0, given the mapping cp : B — >■ 3fi, one could obtain a form 
cp*co on B, and d< p*co = cp*(dco ) = cp*Q — 0. Hence by Green’s formula 

[ cp*co = [ dcp*co — 0. 

JdB JTi 

But the restriction of cp to dB is the identity mapping, and so 




co. 


This last integral, as was verified in Example 1 of Sect. 13.1, is nonzero. This con- 
tradiction completes the proof of the assertion. □ 


This assertion is of course valid for a ball of any dimension (see Example 5 
below). It also holds not only for smooth mappings, but for all continuous mappings 
/ : B — > B. In this general form it is called the Brouwer fixed-point theorem . 8 


13.3.2 The Gauss-Ostrogradskii Formula 

Just as Green’s formula connects the integral over the boundary of a plane domain 
with a corresponding integral over the domain itself, the Gauss-Ostrogradskii for- 
mula given below connects the integral over the boundary of a three-dimensional 
domain with an integral over the domain itself. 

Proposition 2 Let R 3 be three-dimensional space with a fixed coordinate system 
x, y, z and D a compact domain in R 3 bounded by piecewise-smooth surfaces. Let 
P, Q, and R be smooth functions in the closed domain D. 

Then the following relation holds: 



d_Q 

dy 



d.r dy d“ = 



P dy A dz + Qdz A d.v + R d.v A dy. 


(13.30) 


S L.E.J. Brouwer (1881-1966) - well-known Dutch mathematician. A number of fundamental the- 
orems of topology are associated with his name, as well as an analysis of the foundations of math- 
ematics that leads to the philosophico-mathematical concepts called intuitionism. 
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The Gauss-Ostrogradskii formula (13.30) can be derived by repeating the deriva- 
tion of Green’s formula step by step with obvious modifications. So as not to do a 
verbatim repetition, let us begin by considering not a cube in R 3 , but the domain D x 
shown in Fig. 13.7, which is bounded by a lateral cylindrical surface S with genera- 
tor parallel to the z-axis and two caps .3) and S 2 which are the graphs of piecewise- 
smooth functions (p\ and <p 2 defined in the same domain G C IR 3 V . We shall verify 
that the relation 



— dxdvdz = 
3 z 



R dx A dy 


(13.31) 


holds for D z . 


Proof 


IIL 


d r 

— dx dv d- 

/> dz 


<‘<n(x,y) g^ 

— dz = 

)<Pl(x,y) OZ 


= [[ dx dy 
JJg J<pi 

= J I (R{x,y,(p 2 (x,y)) - R(x, y, <pi(x, y))) dxdy = 

= ~JJ (R{x,y,<P\(x,y))dxdy + JJ (R(x, y, (p 2 {x, y)) dx dy. 

The surfaces Si and S 2 have the following parametrizations: 


51 : (x, y) i — > (x,y,<pi(x,y)), 

5 2 ■ (x, y) i — » (x , y , cp 2 (x , y)) . 


The curvilinear coordinates (x, y) define the same orientation on S 2 that is in- 
duced by the orientation of the domain D z , and the opposite orientation on Si. 
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Hence if Si and S 2 are regarded as pieces of the boundary of D : oriented as in- 
dicated in Proposition 2, these last two integrals can be interpreted as integrals of 
the form R dx A dy over Si and S 2 . 

The cylindrical surface S has a parametric representation (t, z) i-»- (x(t), y(t), z), 
so that the restriction of the form R dx A dy to S equals zero, and so consequently, 
its integral over S is also zero. 

Thus relation (13.31) does indeed hold for the domain D z . □ 


If the oriented domain D can be cut into a finite number of domains of the 
type D z , then, since adjacent pieces induce opposite orientations on their common 
boundary, the integrals over these pieces will cancel out, leaving only the integral 
over the boundary 3 D. 

Consequently, formula (13.31) also holds for domains that admit this kind of 
partition into domains of type D z . 

Similarly, one can introduce domains D y and D x whose cylindrical surfaces have 
generators parallel to the y-axis or x-axis respectively and show that if a domain D 
can be divided into domains of type Z) v or D x , then the relations 





Q dz A dx , 


P dy A dz. 


(13.32) 

(13.33) 


Thus, if D is a simple domain, that is, a domain that admits each of the three 
types of partitions just described into domains of types D x , D y , and D z , then, by 
adding (13.31), (13.32), and (13.33), we obtain (13.30) for D. 

For the reasons given in the derivation of Green’s theorem, we shall not undertake 
the description of the conditions for a domain to be simple or any further sharpening 
of what has been proved (in this connection see Problem 8 below or Example 12 in 
Sect. 17.5). 

We note, however, that in the language of forms, the Gauss-Ostrogradskii for- 
mula can be written in coordinate-free form as follows: 



(13.30') 


where co is a smooth 2-form in D. 

Since formula (13.30') holds for the cube I = I 3 — { (x . y, z) e E 3 | 0 < 1 < 1, 
0 < y < 1, 0 < z < 1}, as we have shown, its extension to more general classes of 
domains can of course be carried out using the standard computations (13.28) and 
(13.29). 

Example 3 (The law of Archimedes) Let us compute the buoyant force of a homo- 
geneous liquid on a body D immersed in it. We choose the Cartesian coordinates 
x, y, z in R 3 so that the xy-plane is the surface of the liquid and the z-axis is di- 
rected out of the liquid. A force pgzn do is acting on an element der of the surface S 
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of D located at depth z, where p is the density of the liquid, g is the acceleration 
of gravity, and n is a unit outward normal to the surface at the point of the surface 
corresponding to d a . Hence the resultant force can be expressed by the integral 


F = 



pgzndo. 


If n = e A cos u x + e v cos a y + e- cos a z , then n dcr = e A dy A dz + e v dz A dr + e- dx A 
dy (see Sect. 13.2.4). Using the Gauss-Ostrogradskii formula (13.30), we thus find 
that 


F = e A pg JJ z dy A dz + e y pg JJ 


z dz A dx + e z pg z dx a dy = 


w 


■■ e A pg 
+ e-pg 


K 


0 dx dy dz + e 


’"///, 


0 dx dy dz + 


JJL 


dx dy dj = pgVe z 


where V is the volume of the body D. Hence P = pgV is the weight of a volume of 
the liquid equal to the volume occupied by the body. We have arrived at Archimedes’ 
law: F = Pe z . 

Example 4 Using the Gauss-Ostrogradskii formula (13.30), one can give the fol- 
lowing formulas for the volume V (D) of a body D bounded by a surface 3 D. 


V(D) = - 


-\SL 
fl 


x dy A dz + y dz A dx + z dx A dy = 


3D 

x dy A dz = 


ff y dz A dx = ff : 
J J 3D J J 3D 


z dx A dy. 


13.3.3 Stokes’ Formula in R 3 


Proposition 3 Let S be an oriented piecewise-smooth compact two-dimensional 
surface with boundary 3 S embedded in a domain GcK 3 , in which a smooth 1- 
form a> — P dx + Q dy + R dz is defined. Then the following relation holds: 


J 

JdS 


P dx + Q dy + R dz = 


3 R 3 Q\ 

— ) dy A dz + 

5 \ 3y dz ) 

3 P 3 R\ (d Q 

dz dx J V dx 



dx A dy, 


(13.34) 
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where the orientation of the boundary d S is chosen consistently with the orientation 
of the surface S. 

In other notation, this means that 



Proof If C is a standard parametrized surface <p : I C in R 3 , where I is a square 
in R 2 , relation (13.34) follows from Eqs. (13.28) taking account of what has been 
proved for the square and Green’s formula. 

If the orientable surface S can be cut into elementary surfaces of this type, then 
relation (13.34) is also valid for it, as follows from Eqs. (13.29) with D replaced 
by S. □ 

As in the preceding cases, we shall not prove at this point that, for example, a 
piecewise-smooth surface admits such a partition. 

Let us show what this proof of formula (13.34) would look like in coordinate 
notation. To avoid expressions that are really too cumbersome, we shall write out 
only the first, main part of its two expressions, and with some simplifications even 
in that. To be specific, let us introduce the notation x 1 , x 2 , x 3 for the coordinates of 
a point x e R 3 and verify only that 

f P (x) dx 1 = f f — r- dx 2 A dx 1 H r- dx 3 A dx 1 , 

Jas JJs dx dx J 

since the other two terms on the left-hand side of (13.34) can be studied similarly. 
For simplicity we shall assume that S can be obtained by a smooth mapping x — x(t) 
of a domain D in the plane R 2 of the variables t 1 , t 2 and bounded by a smooth curve 
y = dD parametrized via a mapping t = t(r) by the points of the closed interval 
a < r < f> (Fig. 13.8). Then the boundary r — dS of the surface S can be written as 
x = x(t(r)), where r ranges over the closed interval [a, ft]. Using the definition of 
the integral over a curve, Green’s formula for a plane domain D, and the definition 
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of the integral over a parametrized surface, we find successively 
rP 


Sx 1 dr 1 
3 r 1 dr 


dx l df 2 \ 

Wi7 Ur = 


j ''" kL> ; = /: 

p(xa ) )^)d f 1 + (F(x ( f ) )^)d f 2 = 


-l 
- !L 


3 / 3x _1 \ 3 / 3.x 1 


dt 1 \ dt 2 J dt 2 \ 3r 1 


SPSx 1 dPdx 1 


dr 1 a dr = 


i — r — ^ r — r I dt 1 A dr 2 = 

D \dt l dt 2 dt 2 dt 1 


-//. 5 


3 P dx l Sx 1 

a? Sr 1 3 r 2 


3P dx‘ dx l 
dx' dt 2 dt 1 


dr 1 a dr 2 = 


3P 3x 2 
3x 2 3f* 


- - 


3P dx 3 
dx 3 dt 1 

dP dx 3 


dx 1 

~dt 2 


dx 1 


(d P dx 2 

\ 3x 2 dt 2 + dx 3 dt 2 J dt 1 


dr 1 a dr 2 = 


fL 


( dP 

dx 2 

a i l 

dx 2 

dt 2 

3 P 

3x 3 

dx 3 
ar 1 

dx 3 

dt 2 

) 

1 3x 2 

dx 1 

dt 1 

dx 1 
dt 2 

dx 1 
dt 1 

dx 1 

dr 2 

J 


dr 1 a dr 2 = 


3 P 9 T 3 P T 1 1 

— 3 - dx” A dx H 5 - dx A dx ) . 

3x 2 dx 3 


The colon here denotes equality by definition, and the exclamation point denotes 
a transition that uses the Green’s formula already proved. The rest consists of iden- 
tities. 

Using the basic idea of the proof of formula (13. 34'), we have thus verified 
directly (without invoking the relation <p* d = , but essentially proving it for 

the case under consideration) that formula (13.34) does indeed hold for a simple 
parametrized surface. We have carried out the reasoning formally only for the term 
P dx, but it is clear that the same thing could also be done for the other two terms 
in the 1-form in the integrand on the left-hand side of (13.34). 


13.3.4 The General Stokes Formula 

Despite the differences in the external appearance of formulas (13.25), (13.30), and 
(13.34), their coordinate-free expressions (13.25"), (13.29), (13.30'), and (13.34') 
turn out to be identical. This gives grounds for supposing that we have been dealing 
with particular manifestations of a general rule, which one can now easily guess. 
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Proposition 4 Let S be an oriented piecewise smooth k-dimensional compact sur- 
face with boundary dS in the domain G cl”, in which a smooth (k — \ )-form w is 
defined. 

Then the following relation holds: 



(13.35) 


in which the orientation of the boundary 3 S is that induced by the orientation of S. 


Proof Formula (13.35) can obviously be proved by the same general computations 
(13.28) and (13.29) as Stokes’ formula (13.34') provided it holds for a standard k- 
dimensional interval I k — {x — (x 1 , . . . , x k ) el 1 ' | 0 < x l < 1 ,i = 1, . . . , k}. Let us 
verify that (13.35) does indeed hold for I k . 

Since a (k — l)-form on I k has the form at — JR a,(x) dx 1 A • • • A dx' A • • • A dx k 
(summation over i = 1, . . . , k, with the differential dx' omitted), it suffices to prove 

(13.35) for each individual term. Let a> = a(x) dx 1 A • ■ • A dx' A • • • A dx* . Then 
dco = (— l)' -1 yj (x) dx 1 A • • • A dx' A • • • A dx* . We now carry out the computation: 

f da> = f (— 1)' _1 — r(x) dx 1 A ■ • • A dx* = 

Jp Jjk dx 1 

= (-l Y~ l f dx 1 • ■ • dx' ■ • • dx* f — — r(x)dx' = 

Jp - 1 Jo dx' 


= (— 1)' 1 / (a(x l ,...,x' *, l,x' +1 , ...,x k ) — 

Jp - 1 


— a (x 1 , . 

. . , x' _1 , 0, x' +1 , 

... x*) ) dx 1 • ■ ■ dx' • ■ • dx* 


f ait 1 t‘~ 

P - 1 

1 , 1 , f ' , ...,t k ~ l )dt l ■■■dt k ~ l + 

+ (-i y 

f a(t l ,...,t 1 ' 
Ip- 1 

~\0,t‘, ...,t k ~ l )dt l ■■■dt k ~ x . 

Here 7* -1 is the same as /* in R*. only it is a (k — 1) -dimensional interval in 
JR* -1 . In addition, we have relabeled the variables x 1 —t 1 ,..., x' _1 = f' _1 , x' +1 = 
x k = t k ~ 1 . 

The mappings 

?!- 

UJ 

if 

.,t k ~ 1 )^ (t\ 

1, f', . . . , t k ~ x ) e /*, 

1 

UJ 

..t*- 1 )^ (t\ 

. . . , f <_1 , 0, t\...,t k ~ l ) e/* 


are parametrizations of the upper and lower faces Tj\ and I)o of the interval /* 
respectively orthogonal to the x' axis. These coordinates define the same frame 
e i , . . . , e,- _ i , e,+i , . . . , e* orienting the faces and differing from the frame ei , . . . , ek 
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of B k in the absence of e, . On /} | the vector e, is the exterior normal to I k , as the 
vector — e,- is for the face l)o. The frame e,- , e i , . . . , e,_i , e,+i , . . . , e* becomes the 

frame ei e& after i — 1 inter-changes of adjacent vectors, that is, the agreement 

or disagreement of the orientations of these frames is determined by the sign of 
(— 1) ,_1 . Thus, this parametrization defines an orientation on /j i consistent with 
the orientation of I k if taken with the corrective coefficient (— 1)' _1 (that is, not 
changing the orientation when i is odd, but changing it when i is even). 

Analogous reasoning shows that for the face TJo it is necessary to take a correc- 
tive coefficient (—1)' to the orientation defined by this parametrization of the face 

n o. 

Thus, the last two integrals (together with the coefficients in front of them) can 
be interpreted respectively as the integrals of the form o> over the faces /’, i and /jo 
of I k with the orientation induced by the orientation of I k . 

We now remark that on each of the remaining faces of I k one of the coordinates 
x 1 , , x l ~ x , x !+1 , . . . , x k is constant. Hence the differential corresponding to it is 
equal to zero on such a face. Thus, the form dm is identically equal to zero and its 
integral equals zero over all faces except 7j o and l) i . 

Hence we can interpret the sum of the integrals over these two faces as the in- 
tegral of the form a> over the entire boundary 3 I k of the interval I k oriented in 
consistency with the orientation of the interval I k itself. 

The formula 



and along with it formula (13.35), is now proved. □ 

As one can see, formula (13.35) is a corollary of the Newton-Leibniz formula 
(fundamental theorem of calculus), Fubini’s theorem, and a series of definitions 
of such concepts as surface, boundary of a surface, orientation, differential form, 
differentiation of a differential form, and transference of forms. 

Formulas (13.25), (13.30), and (13.34), the formulas of Green, Gauss- 
Ostrogradskii, and Stokes respectively, are special cases of the general formula 
(13.35). Moreover, if we interpret a function / defined on a closed interval 
[a, b] C 1 as a 0-form co, and the integral of a 0-form over an oriented point as the 
value of the function at that point taken with the sign of the orientation of the point, 
then the Newton-Leibniz formula itself can be regarded as an elementary (but inde- 
pendent) version of (13.35). Consequently, the fundamental relation (13.35) holds 
in all dimensions k > 1 . 

Formula (13.35) is often called the general Stokes formula. As historical infor- 
mation, we quote here some lines from the preface of M. Spivak to his book (cited 
in the bibliography below): 

The first statement of the Theorem 9 appears as a postscript to a letter, dated July 2, 1850, 

from Sir William Thomson (Lord Kelvin) to Stokes. It appeared publicly as question 8 on 


9 The classical Stokes theorem (13.34) is meant. 
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the Smith’s Prize Examination for 1854. This competitive examination, which was taken an- 
nually by the best mathematics students at Cambridge University, was set from 1849 to 1882 
by Professor Stokes; by the time of his death the result was known universally as Stokes’ 
theorem. At least three proofs were given by his contemporaries: Thomson published one, 
another appeared in Thomson and Tait’s Treatise on Natural Philosophy, and Maxwell pro- 
vided another in Electricity and Magnetism. Since this time the name of Stokes has been 
applied to much more general results, which have figured so prominently in the develop- 
ment of certain parts of mathematics that Stokes’ theorem may be considered a case study 
in the value of generalization. 

We note that the modem language of differential forms originates with Elie Car- 
tan, 10 but the form (13.35) for the general Stokes’ formula for surfaces in M" seems 
to have been first proposed by Poincare. For domains in n -dimensional space R" Os- 
trogradskii already knew the formula, and Leibniz wrote down the first differential 
forms. 

Thus it is not an accident that the general Stokes formula (13.35) is sometimes 
called the Newton-Leibniz-Green-Gauss-Ostrogradskii-Stokes Poincare formula. 
One can conclude from what has been said that this is by no means its full name. 
Let us use this formula to generalize the result of Example 2. 

Example 5 Let us show that every smooth mapping / ; B — »■ B of a closed ball 
B C R.'" into itself has at least one fixed point. 

Proof If the mapping / had no fixed points, then, as in Example 2, one could con- 
struct a smooth mapping tp : B —*■ dB that is the identity on the sphere dB. In the 
domain R"' \0, we consider the vector field where r is the radius-vector of the 
point x = (x 1 , . . . , x m ) e R" ! \0, and the flux form 



(— 1)' 1 x' dx 1 A ■ • ■ A dx' A • • • A dx" ! 


co — 


(Cy 1 ) 2 H h(x m ) 2 ) m / 2 


corresponding to this field (see formula (13.19) of Sect. 13.2). The flux of such a 
field across the boundary of the ball B = {x e R | \x\ = 1} in the direction of the 
outward normal to the sphere dB is obviously equal to the area of the sphere dB, 
that is, f ;} H o> f 0. But, as one can easily verify by direct computation, d&> = 0 in 
R"' \0, from which, by using the general Stokes formula, as in Example 2, we find 
that 


JdB 

This contradiction finishes the proof. 



□ 


10 Elie Cartan (1869-1951) - outstanding French geometer. 
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13.3.5 Problems and Exercises 

1. a) Does Green’s formula (13.25) change if we pass from the coordinate system 
x, y to the system y, x? 

b) Does formula (13.25") change in this case? 

2 . a) Prove that formula (13.25) remains valid if the functions P and Q are con- 
tinuous in a closed square /, their partial derivatives |y and ^ are continuous 


at interior points of I, and the double integrals exist, even if as improper integrals 

(13.25'). 


b) Verify that if the boundary of a compact domain D consists of piecewise- 
smooth curves, then under assumptions analogous to those in a), formula (13.25) 
remains valid. 

3 . a) Verify the proof of (13.260 in detail. 

b) Show that if the boundary of a compact domain D C R 2 consists of a finite 
number of smooth curves having only a finite number of points of inflection, then 
D is a simple domain with respect to any pair of coordinate axes. 

c) Is it true that if the boundary of a plane domain consists of smooth curves, 
then one can choose the coordinate axes in R 2 such that it is a simple domain relative 
to them? 

4 . a) Show that if the functions P and Q in Green’s formula are such that 
^ — |y = 1, then the area er(Z>) of the domain D can be found using the for- 
mula a(D) = f aD P dx + Q dy. 

b) Explain the geometric meaning of the integral f y dx over some (possibly 
nonclosed) curve in the plane with Cartesian coordinates x, y. Starting from this, 
give a new interpretation of the formula ct(D) = — f. )D y dx. 

c) As a check on the preceding formula, use it to find the area of the domain 



D = (x, y) e K 2 


5. a) Let x = x(t) be a diffeomorphism of the domain D, C R 2 onto the domain 
D x C R 2 . Using the results of Problem 4 and the fact that a line integral is indepen- 
dent of the admissible change in the parametrization of the path, prove that 



where dx = dx 1 dx 2 , dr = dr 1 dr 2 , |x'(f)| = detx'(r). 
b) From a) derive the formula 



for change of variable in a double integral. 
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6 . Let fix, y, t) be a smooth function satisfying the condition (|^) 2 + ijy) 2 7 ^ 0 
in its domain of definition. Then for each fixed value of the parameter t the equation 
f(x,y,t ) = 0 defines a curve y t in the plane R 2 . Then a family of curves {y f } 
depending on the parameter t arises in the plane. A smooth curve r C R 2 defined 
by parametric equations x = x(t), y — y(t ), is the envelope of the family of curves 
{y,} if the point x(1f), y(to) lies on the corresponding curve y tQ and the curves r 
and y tQ are tangent at that point, for every value of to in the common domain of 
definition of {y t } and the functions xit), y(t). 

a) Assuming that x, y are Cartesian coordinates in the plane, show that the func- 
tions xit), y(t) that define the envelope must satisfy the system of equations 

f(x, y, t) — 0 , 

3 / 

f{-(x,y,t) = 0, 

at 

and from the geometric point of view the envelope itself is the boundary of the 
projection (shadow) of the surface f(x, y, t) — 0 of R 2 t t> on the plane R 2 r y) . 

b) A family of lines x cos a + y sin a — p(a) — 0 is given in the plane with 
Cartesian coordinates x and y. The role of the parameter is played here by the polar 
angle a. Give the geometric meaning of the quantity p(a ), and find the envelope of 
this family if p{a) — c + a cosa + h sin a, where a, b, and c are constants. 

c) Describe the accessible zone of a shell that can be fired from an adjustable 
cannon making any angle a e [0, tt/2] to the horizon. 

d) Show that if the function p(a) of b) is 27r -periodic, then the corresponding 
envelope r is a closed curve. 

e) Using Problem 4, show that the length L of the closed curve r obtained in d) 
can be found by the formula 



(Assume that p e C (2) .) 

f) Show also that the area a of the region bounded by the closed curve r ob- 
tained in d) can be computed as 

1 r do 

c=-/ (p 2 - p 2 )(a)da, p(a) = -f-(a). 

2 Jo da 

7. Consider the integral f cos( / r,n) ds, in which y is a smooth curve in R 2 , r is the 

radius-vector of the point (x, y) e y, r = |r| = y x 1 + y 2 , n j s the unit normal vector 
to y at (x, >') varying continuously along y, and d.v is arc length on the curve. This 
integral is called Gauss ' integral. 

a) Write Gauss’ integral in the form of a flux f (V, n) d.v of the plane vector 
field V across the curve y. 
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b) Show that in Cartesian coordinates x and y Gauss’ integral has the form 

± f y familiar to us from Example 1 of Sect. 13.1, where the choice of 

sign is determined by the choice of the field of normals n. 

c) Compute Gauss’ integral for a closed curve y that encircles the origin once 
and for a curve y bounding a domain that does not contain the origin. 

d) Show that LOs( ) r ' n> d.v = d <p, where <p is the polar angle of the radius-vector r, 
and give the geometric meaning of the value of Gauss’ integral for a closed curve 
and for an arbitrary curve ycR 2 . 

8. In deriving the Gauss-Ostrogradskii formula we assumed that D is a simple do- 

main and the functions P, Q, R belong to C g) * * * * (1) (Z>, R). Show by improving the rea- 
soning that formula (13.30) holds if D is a compact domain with piecewise smooth 
boundary, P, Q, R e C(D,R), and the triple integral con- 

verges, even if it is an improper integral. 

9. a) If the functions P, Q, and R in formula (13.30) are such that + |r = 

1, then the volume V ( D ) of the domain D can be found by the formula 

V (D) = I P dv A dt; + Q dz A dx + R dx A dy. 

JJdD 


b) Let fix, t) be a smooth function of the variables x e Dx C R", t e D, C Rf 
and = (7^7 , . . . , 7^-) ^ 0. Write the system of equations that must be satisfied by 
the (n — 1) -dimensional surface in R” that is the envelope of the family of surfaces 
{ S, | defined by the conditions fix, 1) = 0, t e D, (see Problem 6). 

c) Choosing a point on the unit sphere as the parameter t, exhibit a family of 
planes in R 3 depending on the parameter t whose envelope is the ellipsoid x 2 + 


d) Show that if a closed surface S is the envelope of a family of planes 


cosai(t)x + cos a(t)y + cosa3(f)z — pit) — 0, 


where a 1 , 0:2, (*3 are the angles formed by the normal to the plane and the coordinate 
axes and the parameter t is a variable point of the unit sphere S 2 C R 3 , then the area 
a of the surface S can be found by the formula a — f s2 pit) da. 

e) Show that the volume of the body bounded by the surface S considered in d) 
can be found by the formula V = | f s pit) da. 

2 v 2 

f) Test the formula given in e) by finding the volume of the ellipsoid + 



g) What does the n -dimensional analogue of the formulas in d) and e) look like? 


10. a) Using the Gauss-Ostrogradskii formula, verify that the flux of the field r/r 3 

(where r is the radius-vector of the pointx e R 3 and r = |r|) across a smooth surface 

S enclosing the origin and homeomorphic to a sphere equals the flux of the same 

field across an arbitrarily small sphere \x\ = s. 

b) Show that the flux in a) is 47r . 
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c) Interpret Gauss’ integral f s cos ^ r,n) ds in K 3 as the flux of the field r/r 3 
across the surface S. 

d) Compute Gauss’ integral over the boundary of a compact domain DcK 3 , 
considering both the case when D contains the origin in its interior and the case 
when the origin lies outside D. 

e) Comparing Problems 7 and 10a)-d), give an n -dimensional version of 
Gauss’ integral and the corresponding vector field. Give an n -dimensional statement 
of problems a)-d) and verify it. 

11. a) Show that a closed rigid surface S C R 3 remains in equilibrium under the 
action of a uniformly distributed pressure on it. (By the principles of statics the 
problem reduces to verifying the equalities ff s n da = 0, / /(Jr. n] da = 0, where n 
is a unit normal vector, r is the radius-vector, and [r, n] is the vector product of r 
and n.) 

b) A solid body of volume V is completely immersed in a liquid having spe- 
cific gravity 1 . Show that the complete static effect of the pressure of the liquid on 
the body reduces to a single force F of magnitude V directed vertically upward and 
attached to the center of mass C of the solid domain occupied by the body. 

12. Let r : I k —> D be a smooth (not necessarily homeomorphic) mapping of an 
interval I k C EJ into a domain D of E" , in which a (.--form co is defined. By analogy 
with the one-dimensional case, we shall call a mapping r a k-cell or k-path and by 
definition set j r co— J jk r*co. Study the proof of the general Stokes formula and 
verify that it holds not only for (-dimensional surfaces but also for (--cells. 

13. Using the generalized Stokes formula, prove by induction the formula for 
change of variable in a multiple integral (the idea of the proof is shown in Prob- 
lem 5a)). 

14. Integration by parts in a multiple integral. 

Let D be a bounded domain in E"' with a regular (smooth or piecewise smooth) 
boundary 3 D oriented by the outward unit normal n = (n 1 , . . . , n m ). 

Let /, g be smooth functions in D. 

a) Show that 



b) Prove the following formula for integration by parts: 



Chapter 14 

Elements of Vector Analysis and Field Theory 


14.1 The Differential Operations of Vector Analysis 

14.1.1 Scalar and Vector Fields 

In field theory we consider functions x t-> T (x) that assign to each point x of a 
given domain D a special object T (x) called a tensor. If such a function is defined 
in a domain D, we say that a tensor field is defined in D. We do not intend to give 
the definition of a tensor at this point: that concept will be studied in algebra and 
differential geometry. We shall say only that numerical functions D 3 x m* fix) e 
R and vector- valued functions M. n D D 3 x \->- V (x) e T R" R" are special cases 
of tensor fields and are called scalar fields and vector fields respectively in D (we 
have used this terminology earlier). 

A differential /7-form to in D is a function R" D D 3 x ht- co{x) e £((R") P , R) 
which can be called afield of forms of degree p in D. This also is a special case of 
a tensor field. 

At present we are primarily interested in scalar and vector fields in domains of 
the oriented Euclidean space R" . These fields play a major role in many applications 
of analysis in natural science. 


14.1.2 Vector Fields and Forms in M 3 

We recall that in the Euclidean vector space M 3 with inner product ( , } there is a 
correspondence between linear functionals A : R 3 — » R and vectors Ael 3 consist- 
ing of the following: Each such functional has the form A(t-) = (A, §), where A is 
a completely definite vector in R 3 . 

If the space is also oriented, each skew-symmetric bilinear functional B: R 3 x 
R 3 -> R can be uniquely written in the form i. £ 2 ) = (B> § 1 , § 2 )’ where B is 
a completely definite vector in R 3 and (B. §j,§ 2 )> as always, is the scalar triple 
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product of the vectors B, £j, and § 2 , or what is the same, the value of the volume 
element on these vectors. Thus, in the oriented Euclidean vector space R 3 one can 
associate with each vector a linear or bilinear form, and defining the linear or bilinear 
form is equivalent to defining the corresponding vector in R 3 . 

If there is an inner product in R 3 , it also arises naturally in each tangent space 
T R 3 consisting of the vectors attached to the point R 3 , and the orientation of R 3 
orients each space T R 3 . 

Hence defining a 1-form co l (x) or a 2-form o> 2 (x) in 7’R 3 under the conditions 
just listed is equivalent to defining some vector A(x) e 7’R 3 corresponding to the 
form o ) 1 (x ) or a vector B(.r) e 7’R 3 corresponding to the form ar(x). 

Consequently, defining a 1-form co l or a 2-form of in a domain D of the ori- 
ented Euclidean space R 3 is equivalent to defining the vector field A or B in D 
corresponding to the form. 

In explicit form, this correspondence amounts to the following: 

ffli(*)«) = (A(*),$), (14.1) 

a4W«l.«2) = (BW,« 1 ,S 2 ), (14.2) 

where A(x), B(x), § , § j , and § 2 belong to TD X . 

Here we see the work form of = of^ of the vector field A and the flux form 
or = of the vector field B, which are already familiar to us. 

To a scalar field / : D — > R, we can assign a 0-form and a 3-form in D as follows: 

oP f = f, (14.3) 

co} = f dV, (14.4) 

where dV is the volume element in the oriented Euclidean space R 3 . 

In view of the correspondences (14. 1) — (14.4), definite operations on vector and 
scalar fields correspond to operations on forms. This observation, as we shall soon 
verify, is very useful technically. 

Proposition 1 To a linear combination of forms of the same degree there corre- 
sponds a linear combination of the vector and scalar fields corresponding to them. 

Proof Proposition 1 is of course obvious. However, let us write out the full proof, 
as an example, for 1 -forms: 

+a2«i 2 = at(Ai, ■) + a(A 2 , •) = 

= («iAt +a 2 A 2 ,-) =«^ lAl+a2A2 . □ 

It is clear from the proof that «| and u .2 can be regarded as functions (not neces- 
sarily constant) in the domain D in which the forms and fields are defined. 

As an abbreviation, let us agree to use, along with the symbols ( , ) and [ , ] for 
the inner product and the vector product of vectors A and B in R 3 , the alternative 
notation A • B and A x B wherever convenient. 
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Proposition 2 If A, B, A | , and B] are vector fields in the oriented Euclidean 
space R 3 , then 


®Ai A tt) AixA2’ 

W A A = ®i. B ’ 


(14.5) 

(14.6) 


In other words, the vector product Ai x A 2 of fields A| and At that generate 
1 -forms corresponds to the exterior product of the 1 -forms they generate, since it 
generates the 2-form that results from the product. 

In the same sense the inner product of the vector fields A and B that generate a 
1-form w l A and a 2-form corresponds to the exterior product of these forms. 

Proof To prove these assertions, fix an orthonormal basis in R 3 and the Cartesian 
coordinates x 1 , x , x 3 corresponding to it. 

In Cartesian coordinates 

3 3 

©a(x)($) = A(x) • $ = £ A 1 ' (*)$*' = £> i (x)dx , '($), 

1=1 i = 1 


that is, 

co\ = A 1 dx 1 + A 2 dx 2 + A 3 dx 3 , (14.7) 

and 


<Wb(*)(£i> §2) 


B\x) B 2 (x) B 3 (x) 

*1 ft 3 = 

$1 zi 

(B l (x) dx 2 A dx 3 + B 2 dx 3 A dx 1 + Z? 3 (x) dx 1 A dx 2 )(§ j , § 2 ) ’ 


that is, 

= B 1 dx 2 A dx 3 + B 2 dx 3 A dx 1 + B 3 dx 1 A dx 2 . (14.8) 

Therefore in Cartesian coordinates, taking account of expressions (14.7) and 
(14.8), we obtain 

A = (A} dr 1 + A 2 dx 2 + A 3 dx 3 ) A (A\ dx 1 + A 2 dx 2 + A\ dx 3 ) = 

= (A 2 A 2 — AjA?) dx 2 A dx 3 + (A 3 A) — A\ a\) dx 3 A dx 1 + 

+ (a\ A 2 - A 2 A\) dx 1 A dx 2 = 


where B = Ai x A 2 . 
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Coordinates were used in this proof only to make it easier to find the vector B 
of the corresponding 2-form. The equality (14.5) itself, of course, is independent of 
the coordinate system. 

Similarly, multiplying Eqs. (14.7) and (14.8), we obtain 

4 A = (A 1 B l + A 2 B 2 + A 2 B 2 ) dx 1 A dx 2 A dx 3 = af p . 

In Cartesian coordinates dx 1 A dx 2 A dx 3 is the volume element in R 3 , and the 
sum of the pairwise products of the coordinates of the vectors A and B, which 
appears in parentheses just before the 3-form, is the inner product of these vectors 
at the corresponding points of the domain, from which it follows that p(x) = A(x) • 
B(x). □ 


14.1.3 The Differential Operators grad, curl, div, and V 

Definition 1 To the exterior differentiation of 0-forms (functions), 1-forms, and 2- 
forms in oriented Euclidean space M 3 there correspond respectively the operations 
of finding the gradient (grad) of a scalar field and the curl and divergence (div) of 
a vector field. These operations are defined by the relations 


doff 

^grad / ’ 

(14.9) 

dw A= : 

2 

"curlA’ 

(14.10) 

d«B= : 

3 

W divB' 

(14.11) 


By virtue of the correspondence between forms and scalar and vector fields in R 3 
established by Eqs. (14. 1)— ( 14.4), relations (14.9)— (14. 11) are unambiguous defini- 
tions of the operations grad, curl, and div, performed on scalar and vector fields 
respectively. These operations, the operators of field theory as they are called, cor- 
respond to the single operation of exterior differentiation of forms, but applied to 
forms of different degree. 

Let us give right away the explicit form of these operators in Cartesian coordi- 


nates x 1 , x 2 , x 3 in R 3 . 

As we have explained, in this case 

of f = /, (14.3') 

co[ = A 1 dx 1 + A 2 dx 2 + A 3 dx 3 , (14.7') 

= B l dx 2 A dx 3 + B 2 dx 3 A dx 1 + B 2 dx 1 A dx 2 , (14.8') 

co 2 — p dx 1 A dx 2 A dx 3 . (14.4') 


Since 
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it follows from (14.7') that in these coordinates 


3/ 

grad / = ei ^T+ e 2^9 + e 3TT7T’ 


df_ 
' dx 2 


df_ 

' dx 3 ’ 


(14.9') 


where ei, e2, e3 is a fixed orthonormal basis of K 3 . 
Since 


®curiA := = d(A' d x 1 + A 2 d.v 2 + A 3 dx 3 ) = 

/ 9A 3 3A 2 \ 2 3 / 3A 1 3A 3 \ 3 , 

= ( 3 ^ — 3 — 5 - ) dx“ A dx 3 + I j — ■— j- ) dx 3 A dx 


3x 2 3x 3 / 

3A 2 3A 1 


3.x 1 


3x 2 


dr 1 A dx 2 , 


it follows from (14.8') that in Cartesian coordinates 


3A 3 3A 2 

dx 2 dx 3 


/ 3A 1 

A Sx 3 


curl A = ei I — T - — T ) +e 2 ( — r - — r ) + e 3| at ~ tt )■ (14.10') 


3A 3 
Sx 1 


(d A 2 
A Sx 1 


3A 1 

3x 2 


As an aid to memory this last relation is often written in symbolic form as 


curl A = 


ei 

3 

3jc 1 


A 1 


e2 

e3 

3 

3 

dx 2 

9jc 3 

A 2 

A 3 


(14.10") 


Next, since 


TlivB 


dw B — 

35 1 
dx 1 


d(B 1 chc 2 Adx 3 

d B 2 3 
+ dx 2 + dx 3 ) 


f B 2 dr 3 A dx 1 + B 3 dx 1 
dx 1 A dx 2 A dx 3 , 


A dx 2 ) 


it follows from (14.4') that in Cartesian coordinates 


3 B 3 


3 B l 3 B 2 

dlvB - air + 8^2 + 


(14.11') 


One can see from the formulas (14.9'), (14.10'), and (14.1 1') just obtained that 
grad, curl, and div are linear differential operations (operators). The grad operator 
is defined on differentiable scalar fields and assigns vector fields to the scalar fields. 
The curl operator is also vector- valued, but is defined on differentiable vector fields. 
The div operator is defined on differentiable vector fields and assigns scalar fields 
to them. 

We note that in other coordinates these operators will have expressions that are 
in general different from those obtained above in Cartesian coordinates. We shall 
discuss this point in Sect. 14. 1.5 below. 

We remark also that the vector field curl A is sometimes called the rotation of A 
and written rot A. 
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As an example of the use of these operators we write out the famous 1 system of 
equations of Maxwell, 2 which describe the state of the components of an electro- 
magnetic field as functions of a point x — (x 1 , x 2 , ,v 3 ) in space and time t. 


Example 1 (The Maxwell equations for an electromagnetic field in a vacuum) 

^ . z. aivc = u. 

(14.12) 


1. divE = — . 

£0 


9B 

3. curl E = . 

dt 


j 1 9E 

4. curl B = — — ^ -) — . — — . 




c 2 dt 


Here p(x,t) is the electric charge density (the quantity of charge per unit vol- 
ume), j (x,t) is the electrical current density vector (the rate at which charge is 
flowing across a unit area), E(x, t) and B(jc, t) are the electric and magnetic field 
intensities respectively, and eo and c are dimensioning constants (and in fact c is the 
speed of light in a vacuum). 


In mathematical and especially in physical literature, along with the operators 
grad, curl, and div, wide use is made of the symbolic differential operator nabla 
proposed by Hamilton (the Hamilton operator ) 3 


v = e, s? + e 2 


' dx 2 


■ ^3 


9.V 3 ’ 


(14.13) 


where {ei . ei, ea | is an orthonormal basis of K 3 and x 1 , x 2 , x 3 are the corresponding 
Cartesian coordinates. 

By definition, applying the operator V to a scalar field / (that is, to a function), 
gives the vector field 


9/ 9/ df 

V f — ei — r + ea — ~ + e3 — T , 
dx 1 dx 2 dx 3 


which coincides with the field (14.9'), that is, the nabla operator is simply the grad 
operator written in a different notation. 


'On this subject the famous American physicist and mathematician R. Feynman (1918-1988) 
writes, with his characteristic acerbity, “From a long view of the history of mankind - seen from, 
say, ten thousand years from now - there can be little doubt that the most significant event of the 
19th century will be judged as Maxwell's discovery of the laws of electrodynamics. The American 
Civil War will pale into provincial insignificance in comparison with this important scientific event 
of the same decade.’" Richard R. Feynman, Robert B. Leighton, and Matthew Sands, The Feynman 
Lectures on Physics : Mainly Electromagnetism and Matter , Addison-Wesley, Reading, MA, 1964. 

2 J.C. Maxwell (1831-1879) - outstanding Scottish physicist; he created the mathematical theory 
of the electromagnetic field and is also famous for his research in the kinetic theory of gases, optics 
and mechanics. 

3 W.R. Hamilton (1805-1865) - famous Irish mathematician and specialist in mechanics; he stated 
the variational principle (Hamilton’s principle) and constructed a phenomenological theory of optic 
phenomena; he was the creator of the theory of quaternions and the founder of vector analysis (in 
fact, the term “vector” is due to him). 
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Using, however, the vector form in which V is written, Hamilton proposed a sys- 
tem of formal operations with it that imitates the corresponding algebraic operations 
with vectors. 

Before we illustrate these operations, we note that in dealing with V one must 
adhere to the same principles and cautionary rules as in dealing with the usual 
differentiation operator D — For example, <pDf equals (p ^ and not j^(tpf) 
or /■£:. Thus, the operator operates on whatever is placed to the right of it; left 
multiplication in this case plays the role of a coefficient, that is, (pD is the new 
differential operator ipf^, not the function Moreover, D 2 = D ■ D, that is, 

D 2 f = D(Df)=±(±f) = £f. 

If we now, following Hamilton, deal with V as if it were a vector field defined 
in Cartesian coordinates, then, comparing relations (14.13), (14.9'), (14.10"), and 
(14. IF), we obtain 


grad / = V/, 

(14.14) 

curl A = V x A, 

(14.15) 

divB = V ■ B. 

(14.16) 


In this way the operators grad, curl, and div, can be written in terms of the Hamil- 
ton operator and the vector operations in R 3 . 


Example 2 Only the curl and div operators occurred in writing out the Maxwell 
equations (14.12). Using the principles for dealing with V = grad, we rewrite the 
Maxwell equations as follows, to compensate for the absence of grad in them: 


1. V • E = — . 2. V • B = 0. 

£0 

9B j 1 9E 

3. V x E = . 4.VxB=^ + --. 

dt £ 0 c 2 c 2 dt 


(14.12') 


14.1.4 Some Differential Formulas of Vector Analysis 

In the oriented Euclidean space R 3 we have established the connection (14. 1)— (14.4) 
between forms on the one hand and vector and scalar fields on the other. This con- 
nection enabled us to associate corresponding operators on fields with exterior dif- 
ferentiation (see formulas (14.5), (14.6), and ( 14.9)— (14. 11)). 

This correspondence can be used to obtain a number of basic differential formu- 
las of vector analysis. 

For example, the following relations hold: 


curl (/A) = / curl A — A x grad /, 
div(/A) = A ■ grad / + / div A, 
div(A x B) = B curl A — A • curlB. 


(14.17) 

(14.18) 

(14.19) 
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Proof We shall verify this last equality: 

®divAxB = dw ixB = d ( w A A w b) = d(W A A 0J B - A d(Wg = 

2 112 3 3 3 

= "curlA A W B - CO A A <W“ ur | B = <W B curlA ~ ®A curlB = “B curlA-A-curlB' 

The first two relations are verified similarly. Of course, the verification of all 
these equalities can also be carried out by direct differentiation in coordinates. □ 


If we take account of the relation d 2 co — 0 for any form co, we can also assert that 
the following equalities hold: 

curlgrad/ = 0 , (14.20) 

divcurl A = 0. (14.21) 


Proof Indeed: 


"curlgrad / = dw grad / = d ( d ®/) = d M = °’ 
"divcurlA = d ®curlA = d ( dw i) = d H = °- 


In formulas (14. 17)— (14. 19) the operators grad, curl, and div are applied once, 
while (14.20) and (14.21) involve the second-order operators obtained by successive 
execution of two of the three original operations. Besides the rules given in (14.20) 
and (14.21), one can also consider other combinations of these operators: 

grad div A, curl curl A, div grad /. (14.22) 


The operator div grad is applied, as one can see, to a scalar field. This operator is 
denoted A (Delta) and is called the Laplace operator 4 or Laplacian. 

It follows from (14.9') and (14.1 1') that in Cartesian coordinates 


9 2 / 9 2 / 9 2 / 

SC * 1 ) 2 3 ( x 2 ) 2 9 ( a 3 ) 2 


(14.23) 


Since the operator A acts on numerical functions, it can be applied component- 
wise to the coordinates of vector fields A = ej A 1 + e 2 A 2 + 63 A 3 , where ei , e. 2 , and 
e 3 are an orthonormal basis in ]R 3 . In that case 

AA = ei AA l + e 2 AA 2 + e 3 /\A 3 . 


Taking account of this last equality, we can write the following relation for the 
triple of second-order operators (14.22): 

curl curl A = grad div A — A A, (14.24) 


4 P.S. Laplace (1749-1827) - famous French astronomer, mathematician, and physicist; he made 
fundamental contributions to the development of celestial mechanics, the mathematical theory of 
probability, and experimental and mathematical physics. 


14. 1 The Differential Operations of Vector Analysis 


265 


whose proof we shall not take the time to present (see Problem 2 below). The equal- 
ity (14.24) can serve as the definition of AA in any coordinate system, not neces- 
sarily orthogonal. 

Using the language of vector algebra and formulas (14. 14)— ( 14.16), we can write 
all the second-order operators (14.20)-(14.22) in terms of the Hamilton operator V: 

curl grad / = V x V / = 0, 
div curl A = V ■ (V x A) = 0, 
graddiv A = V(V • A), 
curl curl A = Vx(VxA), 
div grad / = V • V/. 

From the point of view of vector algebra the vanishing of the first two of these 
operators seems completely natural. 

The last equality means that the following relation holds between the Hamilton 
operator V and the Laplacian A : 

A = V 2 . 


14.1.5 *Vector Operations in Curvilinear Coordinates 


a. 

Just as, for example, the sphere x 2 + y 2 + z 2 = a 2 has a particularly simple equation 
R — a in spherical coordinates, vector fields x A(x) in R 3 (or R") often assume 
a simpler expression in a coordinate system that is not Cartesian. For that reason we 
now wish to find explicit formulas from which one can find grad, curl, and div in a 
rather extensive class of curvilinear coordinates. 

But first it is necessary to be precise as to what is meant by the coordinate ex- 
pression for a field A in a curvilinear coordinate system. 

We begin with two introductory examples of a descriptive character. 

Example 3 Suppose we have a fixed Cartesian coordinate system x 1 , x 2 in the Eu- 
clidean plane R 2 . When we say that a vector field (A 1 , A 2 )(x) is defined in R 2 , we 
mean that some vector A(x) e TR 2 is connected with each point x = (x 1 , x 2 ) e R 2 , 
and in the basis of TR 2 consisting of the unit vectors ei (x), e 2 (x) in the coordinate 
directions we have the expansion A(x) = A 1 (x)ei (x) + A 2 (x)e 2 (x) (see Fig. 14.1). 
In this case the basis (ei(x), e 2 (x)} of TR 2 is essentially independent of x. 

Example 4 In the case when polar coordinates (r, (p) are defined in the same 
plane R 2 , at each point x e R 2 \0 one can also attach unit vectors ei(x) = e,-(x), 
eo = e<p (x) (Fig. 14.2) in the coordinate directions. They also form a basis in TR 2 
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Fig. 14.1 




with respect to which one can expand the vector A(x) of the field A attached to 
x : A(x) = A 1 (x)ei (x) + A 2 (x)e 2 (x) . It is then natural to regard the ordered pair of 
functions (A 1 , A 2 )(x) as the expression for the field A in polar coordinates. 

Thus, if (A 1 , A 2 )(x) = (1, 0), this is a field of unit vectors in R 2 pointing radially 
away from the center 0. 

The field (A 1 , A 2 )(x) = (0, 1) can be obtained from the preceding field by rotat- 
ing each vector in it counterclockwise by the angle tt/2. 

These are not constant fields in R 2 , although the components of their coordinate 
representation are constant. The point is that the basis in which the expansion is 
taken varies synchronously with the vector of the field in a transition from one point 
to another. 

It is clear that the components of the coordinate representation of these fields in 
Cartesian coordinates would not be constant at all. On the other hand, a truly con- 
stant field ( consisting of a vector translated parallel to itself to all points of the plane) 
which does have constant components in a Cartesian coordinate system, would have 
variable components in polar coordinates. 


b. 

After these introductory considerations, let us consider more formally the problem 
of defining vector fields in curvilinear coordinate systems. 

We recall first of all that a curvilinear coordinate system t l , f 2 , f 3 in a domain 
D C R 3 is a diffeomorphism (p : D, -> D of a domain D t in the Euclidean parameter 
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space R 3 onto the domain D, as a result of which each point x — (p(t) e D acquires 
the Cartesian coordinates t l ,t 2 ,t 3 of the corresponding point t e D,. 

Since tp is a diffeomorphism, the tangent mapping tp'{t) : 7’R 3 T T 3 = ^ (f) 
is a vector-space isomorphism. To the canonical basis (?) — (1,0,0), £ 2 (0 = 
(0, 1,0), § 3 (?) = (0, 0, 1) of 7’R 3 corresponds the basis of TM.^ ^ consisting of 

the vectors £ ; (x) = tp'(t)^ t (t) — i — 1, 2, 3, giving the coordinate directions. 
To the expansion A(x) = oq^i (jc) + a 2 £ 2 (*) +“ 3 ^ 3 00 °f any vector A(x) e 7’R 3 . in 
this basis there corresponds the same expansion A (?) — ai§i(?) + ® 2 § 2 (0 + a 3 § 3(0 
(with the same components ai, a 2 , (¥ 3 !) of the vector A (?) = (tp')~ ] A(x) in the 
canonical basis 1 1 (?), §•,(?), ^ 3 (?) in 7’R 3 . In the absence of a Euclidean struc- 
ture in R 3 , the numbers « 1 , « 2 , «7 would be the most natural coordinate expression 
for the vector A(x) connected with this curvilinear coordinate system. 


c. 


However, adopting such a coordinate representation would not be quite consistent 
with what we agreed to in Example 4. The point is that the basis (x), § 2 0O> £ 3 (x) 
in 7’R 3 corresponding to the canonical basis £[(?), § 2 (0> £ 3(0 i n TR 3 , although 
it consists of vectors in the coordinate directions, is not at all required to consist of 
unit vectors in those directions, that is, in general £;)(.*) ^ 1. 

We shall now take account of this circumstance which results from the presence 
of a Euclidean structure in R 3 and consequently in each vector space 7’R 3 also. 

Because of the isomorphism tp' {t ) : 7’R 3 — >■ 7’R 3 , we can trans f er the Eu- 
clidean structure of 7’R 3 into 7’R 3 by setting ( 73 , r 2 ) := (<p'x\ , tp'xi) f° r every pair 
of vectors ti , r 2 e 7’R 3 . In particular, we obtain from this the following expression 
for the square of the length of a vector: 


(r, r) = (tp\t)T,tp\t)r) 


dt‘ T ’ dtJ T / 


I dtp dtp 
[dt 1 ’ dti 


(t)r'r J = ($,•,$;>(?) r'r‘ = £;/(?) dt' (r) dt j (r). 


The quadratic form 

d.y 2 = gij(t)dt‘ dt j (14.25) 

whose coefficients are the pairwise inner products of the vectors in the canonical 
basis determines the inner product on 7’R 3 completely. If such a form is defined 
at each point of a domain D t C R 3 , then, as is known from geometry, one says 
that a Riemannian metric is defined in this domain. A Riemannian metric makes 
it possible to introduce a Euclidean structure in each tangent space 7’R 3 (f e D t ) 
within the context of rectilinear coordinates ? 1 , ? , ? 3 in R 3 , corresponding to the 
“curved” embedding tp : D, -* D of the domain D, in the Euclidean space R 3 . 
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If the vectors (x) = tp' (t — 1, 2, 3, are orthogonal in TR 3 ,then 

gij ( t ) = 0 for i ^ j . This means that we are dealing with a triorthogonal coordinate 
grid. In terms of the space TM. 3 it means that the vectors (t), / = 1,2,3, in the 
canonical basis are mutually orthogonal in the sense of the inner product in TM . 3 
defined by the quadratic form (14.25). In what follows, for the sake of simplicity, 
we shall consider only triorthogonal curvilinear coordinate systems. For them, as 
has been noted, the quadratic form (14.25) has the following special form: 

ds 2 = £ 1 (r)(dr 1 ) 2 + E 2 (t)(dt 2 ) 2 + £ 3 (f)(dr 3 ) 2 , (14.26) 

where E t {t) = gait), i = 1,2, 3. 


Example 5 In Cartesian coordinates (x,y,z), cylindrical coordinates (r. tp, z), and 
spherical coordinates ( R , tp, 9) on Euclidean space R 3 the quadratic form (14.25) 
has the respective forms 


ds 2 = dx 2 + dy 2 + dz 2 = (14.26') 

= dr 2 + r 2 dtp 2 + dz 2 = (14.26") 

= d R 2 + R 2 cos 2 6 dtp 2 + R 2 dd 2 . (14.26'") 

Thus, each of these coordinate systems is a triorthogonal system in its domain of 
definition. 

The vectors (f), § 2 ( f )> § 3 (f) of the canonical basis (1, 0, 0), (0, 1, 0), (0, 0, 1) 
in T R 3 , like the vectors §/(x) e TR 3 corresponding to them, have the following 
norm: 5 |§, | = ^/gJJ. Hence the unit vectors (in the sense of the square-norm of a 
vector) in the coordinate directions have the following coordinate representation for 
the triorthogonal system (14.26): 

""'“bfr 0 ' 0 )' e2<0 =( a vk- 0 )- e3(,) =Hvi;} 

(14.27) 

Example 6 It follows from formulas (14.27) and the results of Example 5 that for 
Cartesian, cylindrical, and spherical coordinates, the triples of unit vectors along the 
coordinate directions have respectively the following forms: 

e x = (1,0,0), e v = (0,1,0), e, = (0,0,1); (14.27') 

e,- = (1,0,0), = ^0, - , 0^ , e z = (0,0,1); (14.27") 


5 In the triorthogonal system (14.26) we have ||, | = ^/Ej = H,,i = 1, 2, 3. The quantities H \ , H 2 , 
are usually called the Lame’ coefficient or Lame’ parameters. G. Lame (1795-1870) French 
engineer, mathematician, and physicist. 
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e* = (1,0,0), «V = fo,— ,oY ee = fo,0,4Y (14.27'") 

\ R cos 6 J \ R ) 

Examples 3 and 4 considered above assumed that the vector of the field was ex- 
panded in a basis consisting of unit vectors along the coordinate directions. Hence 
the vector A (?) e 7’R 3 corresponding to the vector A(x) e 7’R 3 of the field should 
be expanded in the basis ej (?), e 2 (t), 63 ( 1 ) consisting of unit vectors in the coordi- 
nate directions, rather than in the canonical basis £ t (t), $ 2 (0> £ 3 ( 1 )- 

Thus, abstracting from the original space R 3 , one can assume that a Riemannian 
metric (14.25) or (14.26) and a vector field t i-a A (t) are defined in the domain 
D, C R 3 and that the coordinate representation (A 1 , A 2 , A 3 )(t) of A (t) at each point 
t e D t is obtained from the expansion of the vector A (?) — A' (?)e, (1) of the field 
corresponding to this point with respect to unit vectors along the coordinate axes. 


d. 

Let us now investigate forms. Under the diffeomorphism (p : I), — > D every form 
in D automatically transfers to the domain D,. This transfer, as we know, occurs 
at each point x e D from the space 7’R 3 into the corresponding space 7’R 3 . Since 
we have transferred the Euclidean structure into 7’R 3 from 7’R 3 , it follows from 
the definition of the transfer of vectors and forms that, for example, to a given form 
cu^(x) = (A(x), •) defined in 7’R 3 . there corresponds exactly the same kind of form 
<y^(?) = (A (?), •} in 7’R 3 , where A(x) = (p'(t)A(t). The same can be said of forms 
of the type and cu 3 , to say nothing of forms oJl - that is, functions. 

After these clarifications, the rest of our study can be confined to the domain 
D t C R 3 , abstracting from the original space R 3 and assuming that a Riemannian 
metric (14.25) is defined in D, and that scalar fields /, p and vector fields A, B 
are defined in D, along with the forms , <y^, o >^ , ® 3 , which are defined at each 
point t e D, in accordance with the Euclidean structure on 7’R 3 defined by the 
Riemannian metric. 

Example 7 The volume element dV in curvilinear coordinates t 1 , t 2 , r 3 , as we 
know, has the form 

dV = ^/det gjj (f ) dr 1 A dt 2 A dr 3 . 

For a triorthogonal system 

dV = y/ E\E 2 Ej,{t) dr 1 A dr 2 A dt 3 . (14.28) 

In particular, in Cartesian, cylindrical, and spherical coordinates, respectively, we 
obtain 


dV = dx A dy A dz = 
= r dr A dq> A dz = 


(14.28') 

(14.28") 
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= R 2 cosddR A dip A dd. 


(14.28"') 


What has just been said enables us to write the form a? p = p dV in different 
curvilinear coordinate systems. 


e. 


Our main problem (now easily solvable) is, knowing the expansion A (t) = 
A' (r)e,- (t) for a vector A (r) e TR 2 with respect to the unit vectors e,(r) e TR 2 , 
i = 1, 2, 3, of the triorthogonal coordinate system determined by the Riemannian 
metric (14.26), to find the expansion of the forms «^(r) and &>g(r) in terms of the 
canonical 1-forms d t' and the canonical 2-forms dt' A dt J respectively. 

Since all the reasoning applies at every given point r, we shall abbreviate the 
notation by suppressing the letter t that shows that the vectors and forms are attached 
to the tangent space at t. 

Thus, ei, e2, e3 is a basis in TR 2 consisting of the unit vectors (14.27) along the 
coordinate directions, and A = A'ei + A 2 e 2 + A 3 e3 is the expansion of A e TR 2 
in that basis. 

We remark first of all that formula (14.27) implies that 



where S'j — 


0, if i±j, 

1, ifi =7, 


(14.29) 


dr' Ad r ' (e*, e/) = 



where S ^ 



I. 


Thus, if := (A, •) = a\ dr 1 + aj dt 2 + <13 dr 3 , then on the one hand 

tt>i(e, ) = (A, e, ) = A' , 


and on the other hand, as one can see from (14.29), 

^(e,) = (rti dr 1 + 02 d r 2 + 03 dr 3 ) (e, ) = a,- • 


Consequently, <7; = A' */Ej, and we have found the expansion 

c 4 = A 1 /£j"dr 1 + A 2 /^dr 2 + A 3 /fi^dr 3 (14.31) 


for the form corresponding to the expansion A = ,4 1 e 1 + A 2 e2 + A 3 e3 of the 
vector A. 
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Example 8 Since in Cartesian, spherical, and cylindrical coordinates we have re- 
spectively 

A = Ax&x T AyGy A z e z — 

= A r e r T Aipftfp A z e z = 

= Ar£r + Apfy + AgCg, 

as follows from the results of Example 6, 

co 1 x = A x Ax + A y Ay + A z Az = (14.31') 

= A r dr + A (p rd( P + A z dz= (14.31") 

= Ar dR + AyR coscp A(p + Ag RAO. (14.31'") 


g- 


NowletB = B l e\ + B 2 G 2 + and <4 — b\ At 2 Adr 3 + £> 2 dr 3 Adr 1 +£> 3 df 1 Adr 2 . 

Then, on the one hand, 

&>B( e 2’ e 3) := dV (B, e 2 , e3) = 

3 

= ^ B l dV(e,-, e 2 , e 3 ) = B l ■ (ei,e 2 ,e 3 ) = B l , 

i - 1 


where dV is the volume element in TR 3 see (14.28) and (14.27)). 

On the other hand, by (14.30) we obtain 

<4(^2, e 3) = (&i d t 2 a dr 3 + & 2 dr 3 a dr 1 + b 3 dt 1 a dr 2 )(e 2 , e3) = 

b] 


= b\ dr 2 a dr 3 (e 2 , e3) = 


V E2E3 


Comparing these results, we conclude that b\ = B 1 E 2 E 3 . Similarly, we verify 
that b 2 = B 2 ^E { E 3 and b 3 — B 2 ^/E\E 2 . 

Thus we have found the representation 


<4 = B 1 \/ E 2 £3 dt 2 A dt 3 + B 1 ypE 3 E\ dt 3 A df 1 + B 2 j E\E 2 dt 1 A At 2 = 


= V E1E2E2, 


dr 2 a dr 3 H — — = dr 3 a dr 1 -t — dr 1 a dr 2 


VE~i 


sfE~2 


s/Ei 


(14.32) 


of the form <4 corresponding to the vector B = /) 1 e 1 + Z? 2 e 2 + Z? 3 e 3. 
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Example 9 Using the notation introduced in Example 8 and formulas (14.26'), 
(14.26") and (14.26'"), we obtain in Cartesian, cylindrical, and spherical coordi- 
nates respectively 

(Wg = B x dy A dz + By dz A dx + B z d.v A dy = (14.32') 

= B, r dcp A dz + By dz A dr + B z r dr A d(p = (14.32") 

= BrR 2 cost? dq> A dO + ByRdO Adfi + BgRcosd dR A dtp. (14.32'") 


h. 


We add further that on the basis of (14.28) we can write 

a? p — p^EiEiE-i dr 1 A dt 2 A dr 3 . (14.33) 

Example 10 In particular, for Cartesian, cylindrical, and spherical coordinates re- 
spectively, formula (14.33) has the following forms: 

co p = pdx A dy A dz = (14.33') 

= pr dr A dtp A dz = (14.33") 

= pR 2 cosOdR A dp AdO. (14.33'") 


Now that we have obtained formulas (14.31 )—( 14.33), it is easy to find the coordi- 
nate representation of the operators grad, curl, and div in a triorthogonal curvilinear 
coordinate system using Definitions (14.9)— (14.1 1). 

Let grad f = A l e i + A 2 e 2 + A 2 63. Using the definitions, we write 

;= da, ° :=df: ‘^ d '' + ^ d ' 2 + W d ' 2 

From this, using formula (14.31), we conclude that 


grad / = 


_L 9/ 

VeT 3 t l 


ei + 


13/1 3 / 

SE^dt ^ 2 + ^dt 2 * 3 


(14.34) 


Example 11 In Cartesian, cylindrical, and spherical coordinates respectively, 


3/ ^ 3/ , 9/ 

dx x + ay v + az ' “ 

(14.34') 

3/ 13/ ^ 3/ 

C/- + C m + C- — 

dr r dcp dz 

(14.34") 

3 /„ , 1 9/ „ , 1 9/„ 

dR K+ RcosO dcp 1/1 + R 2 do e ' 

(14.34'") 
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Suppose given a field A(f) = (A'ei + A 2 e 2 + A 3 e 3 )(f). Let us find the coordi- 
nates B 1 , B 2 , B 3 of the field curlA(f) = B(f) = (fi'ei + B 2 e 2 + B 3 ^)(t). 

Based on the definition (14.10) and formula (14.31), we obtain 


®Lia := d co\ = d(A l /E~idt l + A 1 4~E 1 dt 1 + A 3 /fydt 3 ) = 


/3A 3 v^ 

v dt 2 


9A 




dt 3 ) 


d t 2 A dt 3 + 


/ 9A l */E{ 
V dt 3 


9A 3 yE^\ 

dt 1 J 


dr 3 a dr 1 + 


/9A 2 V£7 

V dt 1 


dA 


]VW\ 


dt 2 J 


dr 1 a dr 2 . 


On the basis of (14.32) we now conclude that 


1 1 

( dA 3 ~J~Ea 

dA 2 -J~Ei 

V£ 2 £ 3 1 

\ dt 2 

dt 3 

1 I 

( dA x ^J~E\ 

9A 3 ^£3 

VE 3 E 1 1 

V 9r 3 

9r ! 

1 1 

(dArjEi. 

9A 1 v / irr 


7LTE2V dt 1 dt 2 )’ 


that is, 


curl A 


I 

y/E\E 2 ET, 


V^Tei 


^£363 

9 

9 

9 

9/ 1 

9r 2 

9/ 3 

VlTa 1 

VE7A 2 

v^A 3 


(14.35) 


Example 12 In Cartesian, cylindrical, and spherical coordinates respectively 

9A, 


9 A- 9A V 


dy 

1/9A, 
dtp 


dz 
dr A 


curl A = —a - —A e , + — - - — e v + — ^ ^ e, = (14.35') 


dz 


e, + 


9A 

dz dx 
/ 9 A,- 9 A; 

V dz dr 


/ dA y _ 9 A., 
\ dx dy 
1 / drA w 


dr 


dA r 

dtp 


e 7 — 


1 / dAg dA<p cos0\ 1 /9A^ dRAg\ 

~ Rcos9 V~9^ 90 /* + R\~d9 9tf~~ 

1 ( dRA v 1 9AjA 

R \ dR cos6 dtp ) 9 


(14.35") 


(14.35'") 
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Now suppose given a field B(f) = (Z? 1 e i + B 2 e 2 + B 3 e 3 )(t). Let us find an expres- 
sion for divB. 

Starting from the definition (14. 1 1) and formula (14.32), we obtain 


®d/vB dfttg = d(B * yj ' /; ; dt~ A df 3 + 

+ B 2 y/ E 3 E\ d t 3 A dr 1 + B 2 y/ E\ E 2 df 1 A df 2 = 

fdy/E^B 1 dyfE^ElB 2 dyfE[E 2 2 B 3 \ , , , 

= , 1- ~ b 7 — dr 1 a dr 2 a dt 3 

V dt l dt 2 dt 3 ) 


On the basis of formula (14.33) we now conclude that 

1 (dyfE^iB 1 dyfE^ElB 2 3 y/E^_B 3 \ 

1V y/EiE 2 E 3 V 9 1 1 dt 2 dt 3 )' 


(14.36) 


In Cartesian, cylindrical, and spherical coordinates respectively, we obtain 


div B = 


dB x 

dx 


+ 


d b v 


+ 


3 B 7 


3 y dz 


drB, 3 B„ 


3 <p 


dr 


d B z 
dz 


1 


R 2 cos0 


dR-cos9BR dRB v dRcosdBg 


3 R 


d(p 


dO 


(14.360 

(14.36") 

(14.36"0 


Relations (14.34) and (14.36) can be used to obtain an expression for the Laplacian 
A — div grad in an arbitrary triorthogonal coordinate system: 

A/ = div grad / = 

a- ( 1 9/ 1 3/ 1 9/ \ _ 

+ y/E~ 2 dt 2 2+ yfEidt 3 7 

= 1 ( 9 ( I E 2 E 3 df} ^ 

y/E x E 2 E 3 \dt x \y Ei dt l ) + 

3 / lEjEi 3 f\ 3 / !EiE 2 df\\ 

3 t 2 Vv e 2 dt 2 ) + dt 3 \]j e 3 dt 3 ))' 


(14.37) 
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Example 13 In particular, for Cartesian, cylindrical, and spherical coordinates, we 
obtain respectively 


3 2 f 3 2 f 3 2 f 

Af -^2+^2+J^~ 

= \L ( r ^\ + + ^l = 

r dr \ dr J r 2 3 <p 2 3 z 2 

_ J_±( R 2 1 9 2 / 1 9 / 9.A 

R 2 dR\ 3 Rj + R 2 cos 2 6 dip 2 + R 2 cos<9 36 \ 36 )' 


(14.37') 

(14.37") 

(14.37'") 


14.1.6 Problems and Exercises 

1. The operators grad, curl, and div and the algebraic operations. 

Verify the following relations: 

for grad: 

a) V(/ + g) = V/ + Vg, 

b) V(/.g) = /Vg + gV/, 

c) V(A B) = (B V)A + (A ■ V)B + B x (V x A) + A x (V x B), 

d) V(jA 2 ) = (A • V)A + A x (V x A); 

for curl: 

e) V x (/A) = /V x A + V/ x A, 

f) V x (A x B) = (B • V)A - (A ■ V)B + (V • B)A - (V ■ A)B; 
for div: 

g) V • (/A) = V/ • A + / V • A, 

h) V • (A x B) = B • (V x A) - A • (V x B) 

and rewrite them in the symbols grad, curl, and div. 

(Hints. A-V = A 1 ^ t +A 2 ^ t + A 3 i %;B-V^V-B;Ax(BxC) = B(A-C)- 

C(A-B).) 

2. a) Write the operators (14.20)-(14.22) in Cartesian coordinates. 

b) Verify relations (14.20) and (14.21) by direct computation. 

c) Verify formula (14.24) in Cartesian coordinates. 

d) Write formula (14.24) in terms of V and prove it, using the formulas of vector 
algebra. 

3. From the system of Maxwell equations in Example 2 deduce that V • j = — . 

4. a) Exhibit the Lame parameters H\, H 2 , 1 13 of Cartesian, cylindrical, and spher- 
ical coordinates in R 3 . 

b) Rewrite formulas (14.28), ( 14.34)— (14.37), using the Lame parameters. 

5. Write the field A = grad y , where r = ^ x 2 + y 2 + z 2 in 
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a) Cartesian coordinates x, y, z; 

b) cylindrical coordinates; 

c) spherical coordinates. 

d) Find curl A and divA. 

6. In cylindrical coordinates ( r,<p,z ) the function / has the form In y. Write the 
field A = grad / in 

a) Cartesian coordinates; 

b) cylindrical coordinates; 

c) spherical coordinates. 

d) Find curl A and divA. 

7 . Write the formula for transformation of coordinates in a fixed tangent space 
TR 3 , p e R 3 , when passing from Cartesian coordinates in R 3 to 

a) cylindrical coordinates; 

b) spherical coordinates; 

c) an arbitrary triorthogonal curvilinear coordinate system. 

d) Applying the formulas obtained in c) and formulas (14.34)-(14.37), verify 
directly that the vector fields grad/, curl A, and the quantities divA and Af are 
invariant relative to the choice of the coordinate system in which they are computed. 

8. The space R 3 , being a rigid body, revolves about a certain axis with constant 
angular velocity co. Let v be the field of linear velocities of the points at a fixed 
instant of time. 

a) Write the field v in the corresponding cylindrical coordinates. 

b) Find curl v. 

c) Indicate how the field curl v is directed relative to the axis of rotation. 

d) Verify that | curl v| = 2 co at each point of space. 

e) Interpret the geometric meaning of curlv and the geometric meaning of the 
constancy of this vector at all points of space for the situation in d). 


14.2 The Integral Formulas of Field Theory 

14.2.1 The Classical Integral Formulas in Vector Notation 

a. Vector Notation for the Forms co l K and 

In the preceding chapter we noted (see Sect. 13.2, formulas (13.23) and (13.24)) that 
the restriction of the work form &>p of a field F to an oriented smooth curve (path) 
y or the restriction of the flux form coh of a field V to an oriented surface S can be 
written respectively in the following forms: 

®fIk = (F,e)dv, a>y | $ — ( V » n ) d<r , 
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where e is the unit vector that orients y , codirectional with the velocity vector of the 
motion along y, civ is the element (form) of arc length on y, n is the unit normal 
vector to S that orients the surface, and da is the element (form) of area on S. 

In vector analysis we often use the vector element of length of a curve ds := e ds 
and the vector element of area on a surface da := nda. Using this notation, we can 
now write: 


(d\\ y = (A, e) ds = (A, ds) = A • ds, (14.38) 

col\ s = (B, n) da = (B, da) = B • da. (14.39) 


b. The Newton-Leibniz Formula 


Let / e C^(D, M), and let y :[a,b]-> D be a path in the domain D. 
Applied to the 0-form ., Stokes’ formula 



means, on the one hand, the equality 



which agrees with the classical formula 

r h 

f (Y (!>))- f(y(aj)= d/(y(r)) 

J a 

of Newton-Leibniz (the fundamental theorem of calculus). On the other hand, by 
definition of the gradient, it means that 



(14.40) 


Thus, using relation (14.38), we can rewrite the Newton-Leibniz formula as 


f{Y( b ))-f{Y(a))= / (grad/) • ds. 

Jy 


(14.40') 


In this form it means that 

the increment of a function on a path equals the work done by the gradient of the function 
on the path. 

This is a very convenient and informative notation. In addition to the obvious 
deduction that the work of the field grad / along a path y depends only on the 
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endpoints of the path, the formula enables us to make a somewhat more subtle ob- 
servation. To be specific, motion over a level surface f = c of / takes place without 
any work being done by the field grad / since in this case grad / ■ dtr = 0. Then, 
as the left-hand side of the formula shows, the work of the field grad / depends not 
even on the initial and final points of the path but only on the level surfaces of / to 
which they belong. 


c. Stokes’ Formula 

We recall that the work of a field on a closed path is called the circulation of the field 
on that path. To indicate that the integral is taken over a closed path, we often write 
£ F ■ ds rather than the traditional notation f F ■ ds. If y is a curve in the plane, we 
often use the symbols £ and £ , in which the direction of traversal of the curve y 
is indicated. 

The term circulation is also used when speaking of the integral over some finite 
set of closed curves. For example, it might be the integral over the boundary of a 
compact surface with boundary. 

Let A be a smooth vector field in a domain D of the oriented Euclidean space R 3 
and S a (piecewise) smooth oriented compact surface with boundary in D. Applied 
to the 1-form a > taking account of the definition of the curl of a vector field, Stokes’ 
formula means the equality 

= curlA- ( 14 - 41 ) 

Using relation (14.39), we can rewrite (14.41) as the classical Stokes formula 


® A ■ ds = 

[[ (curl A) • da . 

Ids J 

Us 


In this notation it means that 

the circulation of a vector field on the boundary of a surface equals the flux of the curl of 
the field across the surface. 

As always, the orientation chosen on 3 S is the one induced by the orientation 
of S'. 



d. The Gauss-Ostrogradskii Formula 

Let V be a compact domain of the oriented Euclidean space R 3 bounded by a 
(piecewise-) smooth surface dV, the boundary of V . If B is a smooth field in V, 
then in accordance with the definition of the divergence of a field, Stokes’ formula 
yields the equality 

0> K — I “divB- 



V 


(14.42) 
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Using relation (14.39) and the notation p dV for the form ax’ in terms of the 
volume element dV in R 3 , we can rewrite Eq. (14.42) as the classical Gauss- 
Ostrogradskii formula 



B ■ da 


divBdV. 


(14.42') 


In this form it means that 

the flux of a vector field across the boundary of a domain equals 
the integral of the divergence of the field over the domain itself. 


e. Summary of the Classical Integral Formulas 

In sum, we have arrived at the following vector notation for the three classical inte- 
gral formulas of analysis: 


Q5 ’ 5 

II 

[ (V/) • ds 
Jy 

(the Newton-Leibniz formula), 

(14.40") 

[ A ■ ds = 

/ (V x A) • da (Stokes’ formula), 

(14.41") 

JdS 

Js 



/ B • da = 

hv 

f (V B)dU 
Jv 

(the Gauss-Ostrogradskii formula). 

(14.42") 


14.2.2 The Physical Interpretation of div, curl, and grad 


a. The Divergence 


Formula (14.42') can be used to explain the physical meaning of divB(x) - the 
divergence of the vector field B at a point x in the domain V in which the field is 
defined. Let V (x) be a neighborhood of x (for example, a ball) contained in V. We 
permit ourselves to denote the volume of this neighborhood by the same symbol 
V (x) and its diameter by the letter d. 

By the mean-value theorem and the formula (14.42') we obtain the following 
relation for the triple integral 


// 

JJ 3V(x) 


B • dcr = divB(x') V(x), 


where x' is a point in the neighborhood V(x). If d 0, then x' x, and since B 
is a smooth field, we also have divB(x') — > divB(x). Hence 


divB(x) = lim 
d^o 


ffdV(x) R ' da 

V(x) 


(14.43) 
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Let us regard B as the velocity field for a flow (of liquid or gas). Then, by the 
law of conservation of mass, a flux of this field across the boundary of the domain 
V or, what is the same, a volume of the medium diverging across the boundary of 
the domain, can arise only when there are sinks or sources (including those asso- 
ciated with a change in the density of the medium). The flux is equal to the total 
power of all these factors, which we shall collectively call “sources”, in the domain 
V (x). Hence the fraction on the right-hand side of (14.43) is the mean intensity (per 
unit volume) of sources in the domain V{x), and the limit of that quantity, that is, 
divB(x), is the specific intensity (per unit volume) of the source at the point x. But 
the limit of the ratio of the total amount of some quantity in the domain V (x) to 
the volume of that domain as d — »• 0 is customarily called the density of that quan- 
tity at x, and the density as a function of a point is usually called the density of the 
distribution of the given quantity in a portion of space. 

Thus, we can interpret the divergence divB of a vector field B as the density 
of the distribution of sources in the domain of the flow, that is, in the domain of 
definition of the field B. 

Example 1 If, in particular, div B = 0, that is, there are no sources, then the flux 
across the boundary of the region must be zero: the amount flowing in equals the 
amount flowing out. And, as formula (14.42') shows, this is indeed the case. 

Example 2 A point electric charge of magnitude q creates an electric field in space. 
Suppose the charge is located at the origin. By Coulomb’s law 6 the intensity E = 
E (x) of the field at the point x e R 3 (that is, the force acting on a unit test charge at 
the point x) can be written as 

q r 

4 ttso |r| 3 ’ 

where so is a dimensioning constant and r is the radius-vector of the point x. 

The field E is defined at all points different from the origin. In spherical coor- 
dinates E = so that by formula (14.36'") of the preceding section, one 

can see immediately that div E = 0 everywhere in the domain of definition of the 


field E. 


Hence, if we take any domain V not containing the origin, then by formula 
(14.42') the flux of E across the boundary dV of V is zero. 

Let us now take the sphere Sr = {x e R 3 | \x\ = R} of radius R with center at 
the origin and find the outward flux (relative to the ball bounded by the sphere) of 
E across this surface. Since the vector e« is itself the unit outward normal to the 
sphere, we find 


i) 



6 Ch.O. Coulomb (1736-1806) - French physicist. He discovered experimentally the law 
(Coulomb’s law) of interaction of charges and magnetic fields using a torsion balance that he 
invented himself. 
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Thus, up to the dimensioning constant sq, which depends on the choice of the 
system of physical units, we have found the amount of charge in the volume bounded 
by the sphere. 

We remark that under the hypotheses of Example 2 just studied the left-hand 
side of formula (14.42') is well-defined on the sphere dV = Sr, but the integrand 
on the right-hand side is defined and equal to zero everywhere in the ball V except 
at one point - the origin. Nevertheless, the computations show that the integral on 
the right-hand side of (14.42') cannot be interpreted as the integral of a function that 
is identically zero. 

From the formal point of view one could dismiss the need to study this situation 
by saying that the field E is not defined at the point 0 e V , and hence we do not have 
the right to speak about the equality (14.42'), which was proved for smooth fields 
defined in the entire domain V of integration. However, the physical interpretation 
of (14.42') as the law of conservation of mass shows that, when suitably interpreted, 
it ought to be valid always. 

Let us study the indeterminacy of div E at the origin in Example 2 more atten- 
tively to see what is causing it. Formally the original field E is not defined at the 
origin, but, if we seek divE from formula (14.43), then, as Example 2 shows, we 
would have to assume that divE(O) = +oo. Hence the integrand on the right-hand 
side of (14.42) would be a “function” equal to zero everywhere except at one point, 
where it is equal to infinity. This corresponds to the fact that there are no charges 
at all outside the origin, and we somehow managed to put the entire charge q into 
a space of volume zero - into the single point 0, at which the charge density nat- 
urally became infinite. Here we are encountering the so-called Dirac 7 5-function 
(delta-function). 

The densities of physical quantities are needed ultimately so that one can find the 
values of the quantities themselves by integrating the density. For that reason there 
is no need to define the 5-function at each individual point; it is more important 
to define its integral. If we assume that physically the “function” 5 V0 (x) = 5(xo; x ) 
must correspond to the density of a distribution, for example the distribution of mass 
in space, for which the entire mass, equal to 1 in magnitude, is concentrated at the 
single point xq, it is natural to set 



when xo e V, 
when xo ^ V. 


Thus, from the point of view of a mathematical idealization of our ideas of the 
possible distribution of a physical quantity (mass, charge, and the like) in space, we 
must assume that its distribution density is the sum of an ordinary finite function 
corresponding to a continuous distribution of the quantity in space and a certain set 


7 P.A.M. Dirac (1902-1984) - British theoretical physicist, one of the founders of quantum me- 
chanics. More details on the Dirac (5-function will be given in Sects. 17.4.4 and 17.5.4. 
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of singular “functions” (of the same type as the Dirac 8 -function) corresponding to 
a concentration of the quantity at individual points of space. 

Hence, starting from these positions, the results of the computations in Exam- 
ple 2 can be expressed as the single equality divE = ^<5(0; x). Then, as applied to 
the field E, the integral on the right-hand side of (14.42') is indeed equal either to 
q /eo or to 0, according as the domain V contains the origin (and the point charge 
concentrated there) or not. 

In this sense one can assert (following Gauss) that the flux of electric field in- 
tensity across the surface of a body equals (up to a factor depending on the units 
chosen) the sum of the electric charges contained in the body. In this same sense 
one must interpret the electric charge density p in the Maxwell equations consid- 
ered in Sect. 14.1 (formula (14.12)). 


b. The Curl 

We begin our study of the physical meaning of the curl with an example. 

Example 3 Suppose the entire space, regarded as a rigid body, is rotating with con- 
stant angular speed a> about a fixed axis (let it be the x-axis). Let us find the curl of 
the field v of linear velocities of the points of space. (The field is being studied at 
any fixed instant of time.) 

In cylindrical coordinates (r, cp, z) we have the simple expression v(r, <p, z) — 
core^,. Then by formula (14.35") of Sect. 14.1, we find immediately that curlv = 
2 cue- . That is, curl visa vector directed along the axis of rotation. Its magnitude 2o> 
equals the angular velocity of the rotation, up to the coefficient 2, and the direction 
of the vector, taking account of the orientation of the whole space R , completely 
determines the direction of rotation. 

The field described in Example 3 in the small resembles the velocity field of 
a funnel (sink) or the field of the vorticial motion of air in the neighborhood of a 
tornado (also a sink, but one that drains upward). Thus, the curl of a vector field 
at a point characterizes the degree of vorticity of the field in a neighborhood of the 
point. 

We remark that the circulation of a field over a closed contour varies in direct 
proportion to the magnitude of the vectors in the field, and, as one can verify using 
the same Example 3, it can also be used to characterize the vorticity of the field. 
Only now, to describe completely the vorticity of the field in a neighborhood of a 
point, it is necessary to compute the circulation over contours lying in three different 
planes. Let us now carry out this program. 

We take a disk Sj (x) with center at the point x and lying in a plane perpendicular 
to the ith coordinate axis, i = 1, 2, 3. We orient S, (x) using a normal, which we take 
to be the unit vector e, along this coordinate axis. Let d be the diameter of 5/ (x ) . 
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From formula (14. 41') for a smooth field A we find that 

<£, ,, , , A • ds 

(curl A) • e/ = lim , (14.44) 

rf-s-0 Si(x ) 

where S,(x) denotes the area of the disk under discussion. Thus the circulation of 
the field A over the boundary 3 S, per unit area in the plane orthogonal to the ith 
coordinate axis characterizes the ith component of curl A. 

To clarify still further the meaning of the curl of a vector field, we recall that 
every linear transformation of space is a composition of dilations in three mutually 
perpendicular directions, translation of the space as a rigid body, and rotation as a 
rigid body. Moreover, every rotation can be realized as a rotation about some axis. 
Every smooth deformation of the medium (flow of a liquid or gas, sliding of the 
ground, bending of a steel rod) is locally linear. Taking account of what has just been 
said and Example 3, we can conclude that if there is a vector field that describes the 
motion of a medium (the velocity field of the points in the medium), then the curl of 
that field at each point gives the instantaneous axis of rotation of a neighborhood of 
the point, the magnitude of the instantaneous angular velocity, and the direction of 
rotation about the instantaneous axis. That is, the curl characterizes completely the 
rotational part of the motion of the medium. This will be made slightly more precise 
below, where it will be shown that the curl should be regarded as a sort of density 
for the distribution of local rotations of the medium. 


c. The Gradient 

We have already said quite a bit about the gradient of a scalar field, that is, about the 
gradient of a function. Hence at this point we shall merely recall the main things. 

Since &>g rad ^ (£) = (grad/, £)=d f(%) = D$f, where D^f is the derivative of 
the function / with respect to the vector §, it follows that grad / is orthogonal to the 
level surfaces of /, and at each point it points in the direction of most rapid increase 
in the values of the function. Its magnitude | grad/| gives the rate of that growth 
(per unit of length in the space in which the argument varies). 

The significance of the gradient as a density will be discussed below. 


14.2.3 Other Integral Formulas 

a. Vector Versions of the Gauss-Ostrogradskii Formula 

The interpretation of the curl and gradient as vector densities, analogous to the inter- 
pretation (14.43) of the divergence as a density, can be obtained from the following 
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classical formulas of vector analysis, connected with the Gauss-Ostrogradskii for- 
mula. 


/ V • B dV — 

/ da ■ B (the divergence theorem), 

(14.45) 

Jv 

JdV 


1 V x AdV = 

/ dcr x A (the curl theorem), 

(14.46) 

V 

hv 


[ V/dV = 
Jv 

/ da f (the gradient theorem). 

JdV 

(14.47) 


The first of these three relations coincides with (14.42') up to notation and is the 
Gauss-Ostrogradskii formula. The vector equalities (14.46) and (14.47) follow from 

(14.45) if we apply that formula to each component of the corresponding vector 
held. 

Retaining the notation V (x) and d used in Eq. (14.43), we obtain from formulas 

(14 .45 ) — ( 1 4.47) in a unified manner, 

[a,/, , dcr • B 

V • B(x) = lim , (14.43') 

d^O V (x) 

. dux A 

V x A(x) = lim -^1 , (14.48) 

d^O V(x) 

V / (x) = lim • (14.49) 

d^o V(x) 

The right-hand sides of (14.45)-(14.47) can be interpreted respectively as the 
scalar flux of the vector held B, the vector flux of the vector held A, and the 
vector flux of the scalar held / across the surface dV bounding the domain V. 
Then the quantities divB, curl A, and grad / on the left-hand sides of Eqs. (14.43'), 
(14.48), and (14.49) can be interpreted as the corresponding source densities of these 
helds. 

We remark that the right-hand sides of Eqs. (14.43'), (14.48), and (14.49) are 
independent of the coordinate system. From these we can once again derive the 
invariance of the gradient, curl, and divergence. 


b. Vector Versions of Stokes’ Formula 

Just as formulas ( 14.45)— ( 14.47) were the result of combining the Gauss-Ostro- 
gradskii formula with the algebraic operations on vector and scalar helds, the fol- 
lowing triple of formulas can be obtained by combining these same operations with 
the classical Stokes formula (which appears as the hrst of the three relations). 
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Let S be a (piecewise-) smooth compact oriented surface with a consistently 
oriented boundary 3 S, let d<r be the vector element of area on S, and ds the vector 
element of length on dS. Then for smooth fields A. B. and /, the following relations 
hold: 


/ da • (V x A) = 

1 ds ■ A, 

(14.50) 

Js 

Jas 


/ (dcr x V) x B = 

/ ds x B, 

(14.51) 

Is 

Jas 


/ do x V/ = 

Js 

f ds/. 

Jas 

(14.52) 


Formulas (14.51) and (14.52) follow from Stokes’ formula (14.50). We shall not 
take time to give the proofs. 


c. Green’s Formulas 

If S is a surface and n a unit normal vector to S, then the derivative D n f of the 
function / with respect to n is usually denoted in field theory. For example, 
(V/, dcr) = (V/, n) der = (grad/, n) da = D n f da = |^da. Thus, da is the 
flux of grad / across the element of surface da . 

In this notation we can write the following formulas of Green, which are very 
widely used in analysis: 


[ V/-Vg dV+ [ gV 2 /dy= [ (gV/)-d<r( = [ g d -f da), (14.53) 
Jv Jv Jav \ Jav on / 

[ (gV 2 /-/V 2 g)dV = 

Jv 

= [ 0(V/-/Vg).da(= [ (g^-/^)daY (14.54) 

Jav \ J av\ on dnj ) 

In particular, if we set f = g in (14.53) and g = 1 in (14.54), we find respectively, 

[ |V/| 2 dV+ [ fAfdV= [ fAf-d<r(=( (14.53/ 

Jv Jv Jav \ Jav on ) 

[ Af dV — f V / ■ do ( = [ ^da). (14.54/ 

Jv Jav \ Jav on J 


This last equality is often called Gauss' theorem. Let us prove, for example, the 
second of Eqs. (14.53) and (14.54): 
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Proof 


h 

JdV 


(gV/ - /Vg) ■ d<r = / V • {gVf - /Vg) dV = 


= / (Vg- V/ + gV 2 /-V/-Vg-/V 2 g)dV = 

Jv 

= [ (gV 2 / - /V 2 g) dV = [ (gAf-fAg)dV. 

Jv Jv 


In this formula we have used the Gauss-Ostrogradskii formula and the relation 
V ■ (<pA) = Vcp ■ A + <pS7 ■ A. □ 


14.2.4 Problems and Exercises 

1. Using the Gauss-Ostrogradskii formula (14.45), prove relations (14.46) and 
(14.47). 

2 . Using Stokes’ formula (14.50), prove relations (14.51) and (14.52). 

3. a) Verify that formulas (14.45), (14.46), and (14.47) remain valid for an un- 
bounded domain V if the integrands in the surface integrals are of order 0(4 ,) as 
r — oo. (Here r = |r|, and r is the radius-vector in K 3 .) 

b) Determine whether formulas (14.50), (14.51), and (14.52) remain valid for a 
noncompact surface S C R 3 if the integrands in the line integrals are of order 0(\ ) 
as r — > oo. 

c) Give examples showing that for unbounded surfaces and domains Stokes’ 
formula (14. 41') and the Gauss-Ostrogradskii formula (14.42') are in general not 
true. 

4 . a) Starting from the interpretation of the divergence as a source density, explain 
why the second of the Maxwell equations (formula (14.12) of Sect. 14.1) implies 
that there are no point sources in the magnetic field (that is, there are no magnetic 
charges). 

b) Using the Gauss-Ostrogradskii formula and the Maxwell equations (formula 
(14.12) of Sect. 14.1), show that no rigid configuration of test charges (for example 
a single charge) can be in a stable equilibrium state in the domain of an electrostatic 
field that is free of the (other) charges that create the field. (It is assumed that no 
forces except those exerted by the field act on the system.) This fact is known as 
Earnshaw’s theorem. 

5 . If an electromagnetic field is steady, that is, independent of time, then the system 
of Maxwell equations (formula ( 14. 12) of Sect. 14.1) decomposes into two indepen- 
dent parts - the electrostatic equations V ■ E = Vx E = 0, and the magnetostatic 

equations VxB = -J-j, V • B = 0. 


14.2 The Integral Formulas of Field Theory 


287 


The equation V ■ E = p/sq, where p is the charge density, transforms via the 
Gauss-Ostrogradskii formula into f s E • da = Q/sq, where the left-hand side is the 
flux of the electric field intensity across the closed surface S and the right-hand side 
is the sum Q of the charges in the domain bounded by S, divided by the dimension- 
ing constant eo- In electrostatics this relation is usually called Gauss’ law. Using 
Gauss’ law, find the electric field E 

a) created by a uniformly charged sphere, and verify that outside the sphere it is 
the same as the field of a point charge of the same magnitude located at the center 
of the sphere; 

b) of a uniformly charged line; 

c) of a uniformly charged plane; 

d) of a pair of parallel planes uniformly charged with charges of opposite sign; 

e) of a uniformly charged ball. 

6. a) Prove Green’s formula (14.53). 

b) Let / be a harmonic function in the bounded domain V (that is, / satisfies 
Laplace’s equation Af — 0 in V ). Show, starting from (14.54') that the flux of the 
gradient of this function across the boundary of the domain V is zero. 

c) Verify that a harmonic function in a bounded connected domain is determined 
up to an additive constant by the values of its normal derivative on the boundary of 
the domain. 

d) Starting from (14.53'), prove that if a harmonic function in a bounded domain 
vanishes on the boundary, it is identically zero throughout the domain. 

e) Show that if the values of two harmonic functions are the same on the bound- 
ary of a bounded domain, then the functions are equal in the domain. 

f) Starting from (14.53), verify the following principle of Dirichlet. Among all 
continuous differentiable functions in a domain assuming prescribed values on the 
boundary , a harmonic function in the region is the only one that minimizes the 
Dirichlet integral ( that is, the integral of the squared-modulus of the gradient over 
the domain). 

7. a) Let r(p, q) = \p — q\ be the distance between the points p and q in the Eu- 
clidean space R 3 . By fixing p, we obtain a function r p (q) of q e R 3 . Show that 
Arffq) = 4jrS(p\ q), where 8 is the 5-function. 

b) Let g be harmonic in the domain V. Setting / = I / r p in (14.54) and taking 
account of the preceding result, we obtain 



Prove this equality precisely. 

c) Deduce from the preceding equality that if S’ is a sphere of radius R with 
center at p, then 



This is the so-called mean-value theorem for harmonic functions. 
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d) Starting from the preceding result, show that if B is the ball bounded by the 
sphere S considered in part c) and V(B) is its volume, then 

s ^vmL siv - 

e) If p and cj are points of the Euclidean plane R 2 , then along with the func- 
tion y- considered in a) above (corresponding to the potential of a charge located 

at p), we now take the function In y- (corresponding to the potential of a uniformly 
charged line in space). Show that A In f = 2jx&(p\ q), where S(p\ q) is now the 
5-function in R 2 . 

f) By repeating the reasoning in a), b), c), and d), obtain the mean- value theorem 
for functions that are harmonic in plane regions. 

8. Cauchy ’s multi-dimeusional mean-value theorem. 

The classical mean-value theorem for the integral (“Lagrange’s theorem”) asserts 
that if the function / : D — > R is continuous on a compact, measurable, and con- 
nected set D C R” (for example, in a domain), then there exists a point £ e D such 
that 

[ /(x)dx = /(£)• | £>|, 

J D 

where |D| is the measure (volume) of D. 

a) Now let /, g e C(D, R). that is, / and g are continuous real-valued functions 
in D. Show that the following theorem (“Cauchy’s theorem”) holds: There exists 
(eD such that 

g($) [ /(*)dx = /(f) [ g(x)dx. 

J D J D 

b) Let D be a compact domain with smooth boundary 3 D and f and g two 
smooth vector fields in D. Show that there exists a point if e I) such that 

div g(§ ) • Flux f = div f(§ ) • Flux g, 
aD dD 

where Flux is the flux of a vector field across the surface dD. 

3D 


14.3 Potential Fields 

14.3.1 The Potential of a Vector Field 

Definition 1 Let A be a vector field in the domain D C R" . A function U : D -> R 
is called a potential of the field A if A = grad U in D. 
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Definition 2 A field that has a potential is called a potential field. 

Since the partial derivatives of a function determine the function up to an additive 
constant in a connected domain, the potential is unique in such a domain up to an 
additive constant. 

We briefly mentioned potentials in the first part of this course. Now we shall 
discuss this important concept in somewhat more detail. In connection with these 
definitions we note that when different force fields are studied in physics, the po- 
tential of a field F is usually defined as a function U such that F = — grad U . This 
potential differs from the one given in Definition 1 only in sign. 

Example 1 At a point of space having radius-vector r the intensity F of the grav- 
itational field due to a point mass M located at the origin can be computed from 
Newton’s law as 

F = -GM^r, (14.55) 

r i 

where r = |r|. 

This is the force with which the field acts on a unit mass at this point of space. The 
gravitational field (14.55) is a potential field. Its potential in the sense of Definition 1 
is the function 

U = GM~. (14.56) 

r 

Example 2 At a point of space having radius-vector r the intensity E of the electric 
field due to a point charge q located at the origin can be computed from Coulomb’s 
law 

q r 

E = — T 

47T£o 

Thus such an electrostatic field, like the gravitational field, is a potential field. Its 
potential (p in the sense of physical terminology is defined by the relation 


4tt£o r 


14.3.2 Necessary Condition for Existence of a Potential 

In the language of differential forms the equality A = grad U means that = 
deny = d U, from which it follows that 

d<4 = 0, (14.57) 

since d 2 co^ = 0. This is a necessary condition for the field A to be a potential field. 
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In Cartesian coordinates this condition can be expressed very simply. If A = 
(A 1 , . . . , A") and A = grad U, then A' = |p, i = 1, . . . , n, and if the potential U is 
sufficiently smooth (for example, if its second-order partial derivatives are continu- 
ous), we must have 


9 A'' _ dA-i 
dxJ dx‘ 


i,j = 1 


(14.57') 


which simply means that the mixed partial derivatives are equal in both orders: 


d 2 U _ d 2 U 
dx'dxJ dxJdx' 


In Cartesian coordinates a> A — A' dr', and therefore the equalities (14.57) 
and (14.57') are indeed equivalent in this case. 

In the case of R 3 we have d — ®^ urlA , so that the necessary condition (14.57) 
can be rewritten as 

curl A = 0, 

which corresponds to the relation curl grad U = 0, which we already know. 


Example 3 The field A = (x, xy. xyz) in Cartesian coordinates in R 3 cannot be a 
potential field, since, for example, ^ jp . 

Example 4 Consider the field A = (A x , A y ) given by 


A = 


y x \ 

x 2 + y 2 ’ x 2 + y 2 J ’ 


(14.58) 


defined in Cartesian coordinates at all points of the plane except the origin. The 
necessary condition for a field to be a potential field is fulfilled in this 

case. However, as we shall soon verify, this field is not a potential field in its domain 
of definition. 


Thus the necessary condition (14.57), or, in Cartesian coordinates (14.57'), is in 
general not sufficient for a field to be a potential field. 


14.3.3 Criterion for a Field to be Potential 


Proposition 1 A continuous vector field A in a domain D C R" is a potential field 
in D if and only if its circulation (work) around every closed curve y contained in D 
is zero'. 


A ■ ds = 0. 


(14.59) 
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Proof Necessity. Suppose A = grad U . Then by the Newton-Leibniz formula (For- 
mula (14.400 of Sect. 14.2), 



where y '■ [«, b] —*■ D. If y(a) — y(b), that is, when the path y is closed, it is ob- 
vious that the right-hand side of this last equality vanishes, and hence the left-hand 
side does also. 

Sufficiency. Suppose condition (5) holds. Then the integral over any (not necessar- 
ily closed) path in D depends only on its initial and terminal points, not on the path 
joining them. Indeed, if y\ and yj are two paths having the same initial and termi- 
nal points, then, traversing first y \ , then —yj (that is, traversing yj in the opposite 
direction), we obtain a closed path y whose integral, by (14.59), equals zero, but 
is also the difference of the integrals over y\ and yi- Hence these last two integrals 
really are equal. 

We now fix some point xq e D and set 



(14.60) 


where the integral on the right is the integral over any path in D from xo to x. We 
shall verify that the function U so defined is the required potential for the field A. 
For convenience, we shall assume that a Cartesian coordinate system (x 1 , . . . , x") 
has been chosen in M". Then A • ds = A 1 dx 1 + ■ • ■ + A" dx". If we move away 
from x along a straight line in the direction /ze, , where e, is the unit vector along the 
x'-axis, the function U receives an increment equal to 



jc 1 , . . . , Jt’ 1 , t , x !+1 , . . . , x”) dt, 


U (x + fte,) — U(x) = 


equal to the integral of the form A ■ ds over this path from x to x + /ze, . By the 
continuity of A and the mean- value theorem, this last equality can be written as 

U(x + /ze, ) — f/(x) = A 1 (x 1 , . . . , x‘ _1 ,x' + Oh, x !+1 , . . . , x")/z, 

where 0 < 6 < 1 . Dividing this last equality by h and letting h tend to zero, we find 


U U 

— 7 (x ) = A (x), 
dx' 


that is, A = grad U . 


□ 


Remark 1 As can be seen from the proof, a sufficient condition for a field to be a 
potential held is that (14.59) hold for smooth paths or, for example, for broken lines 
whose links are parallel to the coordinate axes. 
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We now return to Example 4. Earlier (Example 1 of Sect. 8.1) we computed that 
the circulation of the field (14.58) over the circle x 2 + y 2 — 1 traversed once in the 
counterclockwise direction was 2 0). 

Thus, by Proposition 1 we can conclude that the field (14.58) is not a potential 
field in the domain R 2 \0. 

But surely, for example, 


grad arctan 


y 

x 


y x_\ 

. x 2 + y 2 ’ x 2 + y 2 )’ 


and it would seem that the function arctan ^ is a potential for (14.58). What is this, 
a contradiction?! There is no contradiction as yet, since the only correct conclusion 
that one can make in this situation is that the function arctan ^ is not defined in the 
entire domain R 2 \0. And that is indeed the case: Take for example, the points on 
the y-axis. But then, you may say, we could consider the function <p(x, y), the polar 
angular coordinate of the point (x, y). That is practically the same thing as arctan 
but <p(x, y) is also defined for x — 0, provided the point (x , y) is not at the origin. 
Throughout the domain R 2 \0 we have 


dtp = — 


x 2 + y 2 


dx 


2 , 2 d y- 

x- + y z 


However, there is still no contradiction, although the situation is now more delicate. 
Please note that in fact tp is not a continuous single-valued function of a point in 
the domain R 2 \0. As a point encircles the origin counterclockwise, its polar angle, 
varying continuously, will have increased by 2tz when the point returns to its starting 
position. That is, we arrive at the original point with a new value of the function, 
different from the one we began with. Consequently, we must give up either the 
continuity or the single-valuedness of the function tp in the domain R 2 \0. 

In a small neighborhood (not containing the origin) of each point of the domain 
R 2 \0 one can distinguish a continuous single- valued branch of the function tp. All 
such branches differ from one another by an additive constant, a multiple of 2: r. 
That is why they all have the same differential and can all serve locally as potentials 
of the field (14.58). Nevertheless, the field (14.58) has no potential in the entire 
domain R 2 \0. 

The situation studied in Example 4 turns out to be typical in the sense that the 
necessary condition (14.57) or (14.57') for the field A to be a potential field is locally 
also sufficient. The following proposition holds. 


Proposition 2 If the necessary condition for a field to be a potential field holds in 
a ball , then the field has a potential in that ball. 


Proof For the sake of intuitiveness we first carry out the proof in the case of a disk 
D — {(x, y) e R 2 | x 2 + y 2 < r} in the plane R 2 . One can arrive at the point (x, y) 
of the disk from the origin along two different two-link broken lines y\ and y 2 with 
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links parallel to the coordinate axes (see Fig. 14.3). Since D is a convex domain, the 
entire rectangle I bounded by these lines is contained in D. 

By Stokes’ formula, taking account of condition (14.57), we obtain 



By the remark to Proposition 1 we can conclude from this that the field A is 
a potential field in D. Moreover, by the proof of sufficiency in Proposition 1, the 
function (14.60) can again be taken as the potential, the integral being interpreted 
as the integral over a broken line from the center to the point in question with links 
parallel to the axes. In this case the independence of the choice of path y \ , yi for 
such an integral followed immediately from Stokes’ formula for a rectangle. 

In higher dimensions it follows from Stokes’ formula for a two-dimensional rect- 
angle that replacing two adjacent links of the broken line by two links forming the 
sides of a rectangle parallel to the original does not change the value of the integral 
over the path. Since one can pass from one broken-line path to any other broken-line 
path leading to the same point by a sequence of such reconstructions, the potential 
is unambiguously defined in the general case. □ 


14.3.4 Topological Structure of a Domain and Potentials 

Comparing Example 4 and Proposition 2, one can conclude that when the necessary 
condition (14.57) for a field to be a potential field holds, the question whether it 
is always a potential field depends on the (topological) structure of the domain in 
which the field is defined. The following considerations (here and in Sect. 14.3.5 
below) give an elementary idea as to exactly how the characteristics of the domain 
bring this about. 

It turns out that if the domain D is such that every closed path in D can be 
contracted to a point of the domain without going outside the domain, then the 
necessary condition (14.57) for a field to be a potential field in D is also sufficient. 
We shall call such domains simply connected below. A ball is a simply connected 
domain (and that is why Proposition 2 holds). But the punctured plane K 2 \0 is 
not simply connected, since a path that encircles the origin cannot be contracted 
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to a point without going outside the region. This is why not every field in R 2 \0 
satisfying (14. 57'), as we saw in Example 4, is necessarily a potential field in R 2 \0. 

We now turn from the general description to precise formulations. We begin by 
stating clearly what we mean we speak of deforming or contracting a path. 

Definition 3 A homotopy (or deformation ) in D from a closed path yo : [0. I ] — > 
D to a closed path y\ : [0, 1 ] — > I) i s a continuous mapping r : I 2 -» D of the 
square 7 2 = {(r 1 , t 2 ) e R 2 | 0 < t' < 1, i = 1, 2} into D such that r(t l , 0) = yo(t l ), 
r(t\ 1) = y\ (r 1 ), and r(0, t 2 ) = T(l, t 2 ) for all t\t 2 e [0, 1], 

Thus a homotopy is a mapping r : I 2 — > D (Fig. 14.4). If the variable t 2 is 
regarded as time, according to Definition 3 at each instant of time t = t 2 we have 
a closed path W(r 1 , t) = y, (Fig. 14. 4). 8 The change in this path with time is such 
that at the initial instant t — t 2 = 0 it coincides with yo and at time t — t 2 = 1 it 
becomes y\ . 

Since the condition y t ( 0) = T (0, t) = T(l, t) = y t (l), which means that the path 
y t is closed, holds at all times t e [0, 1], the mapping r : I 2 -* D induces the 
same mappings fio (f 1 ) := r(t , 0) = r(t l , 1) =: ft i (7 1 ) on the vertical sides of the 
square I 2 . 

The mapping r is a formalization of our intuitive picture of gradually deforming 
Ko to yi . 

It is clear that time can be allowed to run backwards, and then we obtain the path 
yo from yi . 


Definition 4 Two closed paths are homotopic in a domain if they can be obtained 
from each other by a homotopy in that domain, that is a homotopy can be con- 
structed in that domain from one to the other. 


“Orienting arrows are shown along certain curves in Fig. 14.4. These arrows will be used a little 
later; for the time being the reader should not pay any attention to them. 
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Remark 2 Since the paths we have to deal with in analysis are as a rule paths of 
integration, we shall consider only smooth or piecewise-smooth paths and smooth 
or piecewise-smooth homotopies among them, without noting this explicitly. 

For domains in R" one can verify that the presence of a continuous homotopy of 
(piecewise-) smooth paths guarantees the existence of (piecewise-) smooth homo- 
topies of these paths. 

Proposition3 If the l -form co A in the domain D is such that dco A = 0, and the 
closed paths yo and y\ are homotopic in D, then 



Proof Let r : I 2 — > D be a homotopy from yo to y\ (see Fig. 14.4). If lo and l\ 
are the bases of the square 1 2 and Jo and J\ its vertical sides, then by definition of 
a homotopy of closed paths, the restrictions of r to Io and I\ coincide with yo and 
y\ respectively, and the restrictions of P to Jq and J\ give some paths fio and fi\ 
in D. Since /’(0. t 2 ) = P( I . t 2 ), the paths fio and f J >\ are the same. As a result of the 
change of variables x — P(t), the form o> A transfers to the square 1 2 as some 1-form 
co — r*co l A . In the process d co — dF*cu^ = r* d&>^ = 0, since d&>^ = 0. Hence, by 
Stokes’ formula 



Definition 5 A domain is simply connected if every closed path in it is homotopic 
to a point (that is, a constant path). 

Thus simply connected domains are those in which every closed path can be 
contracted to a point. 

Proposition 4 If a field A defined in a simply connected domain D satisfies the 
necessary condition (14.57) or (14. 57') to be a potential field, then it is a potential 
field in D. 

Proof By Proposition 1 and Remark 1 it suffices to verify that Eq. (14.59) holds for 
every smooth path y in D. The path y is by hypothesis homotopic to a constant 
path whose support consists of a single point. The integral over such a one-point 
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path is obviously zero. But by Proposition 3 the integral does not change under a 
homotopy, and so Eq. (14.59) must hold for y . □ 

Remark 3 Proposition 4 subsumes Proposition 2. However, since we had certain 
applications in mind, we considered it useful to give an independent constructive 
proof of Proposition 2. 

Remark 4 Proposition 2 was proved without invoking the possibility of a smooth 
homotopy of smooth paths. 


14.3.5 Vector Potential. Exact and Closed Forms 

Definition 6 A field A is a vector potential for a field B in a domain D C R 3 if the 
relation B = curl A holds in the domain. 

If we recall the connection between vector fields and forms in the oriented Eu- 
clidean space R 3 and also the definition of the curl of a vector field, the rela- 
tion B = curl A can be rewritten as w 2 = d&>^. It follows from this relation that 
= dwjj = d 2 &>^ = 0. Thus we obtain the necessary condition 

divB = 0, (14.61) 

which the field B must satisfy in D in order to have a vector potential, that is, in 
order to be the curl of a vector field A in that domain. 

A field satisfying condition (14.61) is often, especially in physics, called a 
solenoidal field. 

Example 5 In Sect. 14. 1 we wrote out the system of Maxwell equations. The second 
equation of this system is exactly Eq. (14.61). Thus, the desire naturally arises to 
regard a magnetic field B as the curl of some vector field A - the vector potential 
of B. When solving the Maxwell equations, one passes to exactly such a vector 
potential. 

As can be seen from Definitions 1 and 6, the questions of the scalar and vector 
potential of vector fields (the latter question being posed only in R 3 ) are special 
cases of the general question as to when a differential p-form co p is the differential 
d a) p ~ l of some form a> p ~ l . 

Definition 7 A differential form co p is exact in a domain D if there exists a form 
co p ~ l in D such that co p — d a> p ~ l . 

If the form a> p is exact in D, then do> p = d 2 a> p ~ l — 0. Thus the condition 

da> = 0 (14.62) 


is a necessary condition for the form co to be exact. 
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Fig. 14.5 



7o 


0 


As we have already seen (Example 4), not every form satisfying this condition is 
exact. For that reason we make the following definition. 

Definition 8 The differential form to is closed in a domain D if it satisfies condition 
(14.62) there. 

The following theorem holds. 

Theorem (Poincare’s lemma) If a form is closed in a ball, then it is exact there. 

Here we are talking about a ball in R" and a form of any order, so that Proposi- 
tion 2 is an elementary special case of this theorem. 

The Poincare lemma can also be interpreted as follows: The necessary condition 
(14.62) for a form to be exact is also locally sufficient, that is, every point of a 
domain in which (14.62) holds has a neighborhood in which to is exact. 

In particular, if a vector field B satisfies condition (14.61), it follows from the 
Poincare lemma that at least locally it is the curl of some vector field A. 

We shall not take the time at this point to prove this important theorem (those 
who wish to do so can read it in Chap. 15). We prefer to conclude by explaining 
in general outline the connection between the problem of the exactness of closed 
forms and the topology of their domains of definition (based on information about 
1 -forms). 

Example 6 Consider the plane R 2 with two points p\ and pi removed (Fig. 14.5), 
and the paths yo,yi, and yi whose supports are shown in the figure. The path yi can 
be contracted to a point inside D, and therefore if a closed form to is given in D, 
its integral over y 2 is zero. The path yo cannot be contracted to a point, but without 
changing the value of the integral of the form, it can be homotopically converted 
into the path y\ . 
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The integral over y\ obviously reduces to the integral over one cycle enclosing 
the point p\ clockwise and the double of the integral over a cycle enclosing pi 
counterclockwise. If T\ and T 2 are the integrals of the form a> over small circles 
enclosing the points p\ and pi and traversed, say, counterclockwise, one can see that 
the integral of the form to over any closed path in D will be equal to n 1 7j + niTi, 
where n 1 and ni are certain integers indicating how many times we have encircled 
each of the holes p\ and pi in the plane R 2 and in which direction. 

Circles ci and ci enclosing p\ and pi serve as a sort of basis in which every 
closed path y C D has the form y = n\c\ + nici. up to a homotopy, which has 
no effect on the integral. The quantities f 01 = T, are called the cyclic constants 
or the periods of the integral. If the domain is more complicated and there are A 
independent elementary cycles, then in agreement with the expansion y —n\c\ + 
■ ■ ■ + tikCk, it results that f co — n\ 7j + • • • + nkJk- It turns out that for any set 
T[, ... ,Tk of numbers in such a domain one can construct a closed 1-form that will 
have exactly that set of periods. (This is a special case of de Rham’s theorem - see 
Chap. 15.) 

For the sake of visualization, we have resorted here to considering a plane do- 
main, but everything that has been said can be repeated for any domain D cK". 

Example 7 In an anchor ring (the solid domain in R 3 enclosed by a torus) all closed 
paths are obviously homotopic to a circle that encircles the hole a certain number of 
times. This circle serves as the unique non-constant basic cycle c. 

Moreover, everything that has just been said can be repeated for paths of higher 
dimension. If instead of one-dimensional closed paths - mappings of a circle or, 
what is the same, mappings of the one-dimensional sphere - we take mappings 
of a Ar-dimensional sphere, introduce the concept of homotopy for them, and ex- 
amine how many mutually nonhomotopic mappings of the ^-dimensional sphere 
into a given domain D C R" exist, the result is a certain characteristic of the do- 
main D which is formalized in topology as the so-called Ath homotopy group of 
D and denoted jtk(D). If all the mappings of the A-dimensional sphere into D are 
homotopic to a constant mapping, the group Jtk(D) is considered trivial. (It con- 
sists of the identity element alone.) It can happen that it\ ( D ) is trivial and tci(D) is 
not. 

Example 8 If D is taken to be the space R 3 with the point 0 removed, obviously 
every closed path in D can be contracted to a point, but a sphere enclosing the 
point 0 cannot be homotopically converted to a point. 

It turns out that the homotopy group itk(D) has less to do with the periods of a 
closed A-form than the so-called homology group Hk(D). (See Chap. 15.) 

Example 9 From what has been said we can conclude that, for example, in the do- 
main R 3 \0 every closed 1-form is exact (R 3 \0 is a simply connected domain), but 
not very closed 2-form is exact. In the language of vector fields, this means that 
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every irrotational field A in R 3 \0 is the gradient of a function, but not every source- 
free field B(divB = 0) is the curl of some field in this domain. 

Example 10 To balance Example 9 we take the anchor ring. For the anchor ring the 
group 7Ti (D) is not trivial (see Example 7), but tti ( D ) is trivial, since every mapping 
/ : S 2 — > D of the two-sphere into D can be contracted to a constant mapping (any 
image of a sphere can be contracted to a point). In this domain not every irrotational 
field is a potential field, but every source-free field is the curl of some field. 


14.3.6 Problems and Exercises 

1. Show that every central field A = f(r) r is a potential field. 

2. Let F = — grad U be a potential force field. Show that the stable equilibrium 
positions of a particle in such a field are the minima of the potential U of that field. 

3. For an electrostatic field E the Maxwell equations (formula (14. 12) of Sect. 14. 1 ), 
as already noted, reduce to the pair of equations V • E = j- and V xE = 0. 

The condition VxE = 0 means, at least locally, that E = — grad The field of 
a point charge is a potential field, and since every electric field is the sum (or inte- 
gral) of such fields, it is always a potential field. Substituting E = —V(p in the first 
equation of the electrostatic field, we find that its potential satisfies Poisson ’s equa- 
tion 9 Acp — The potential <p determines the field completely, so that describing 
E reduces to finding the function <p, the solution of the Poisson equation. 

Knowing the potential of a point charge (Example 2), solve the following prob- 
lem. 

a) Two charges +q and —q are located at the points (0, 0, — d /2) and (0, 0, d /2) 
in R 3 with Cartesian coordinates ( x,y,z )• Show that at distances that are large 
relative to d the potential of the electrostatic field has the form 



where r is the absolute value of the radius-vector r of the point (x,y,z). 

b) Moving very far away from the charges is equivalent to moving the charges 
together, that is, decreasing the distance d. If we now fix the quantity qd =: p and 
decrease d. then in the limit we obtain the function q> — p in the domain 

M 3 \0. It is convenient to introduce the vector p equal to p in absolute value and 
directed from — q to +q. We call the pair of charges —q and +q and the construction 
obtained by the limiting procedure just described a dipole, and the vector p the 


9 S.D. Poisson (1781-1849) - French scientist, specializing in mechanics and physics; his main 
work was on theoretical and celestial mechanics, mathematical physics, and probability theory. 
The Poisson equation arose in his research into gravitational potential and attraction by spheroids. 
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dipole moment. The function <p> obtained in the limit is called the dipole potential. 
Find the asymptotics of the dipole potential as one moves away from the dipole 
along a ray forming angle 0 with the direction of the dipole moment. 

c) Let <p(t be the potential of a unit point charge and <p\ the dipole potential 
having dipole moment pi . Show that q > i = — (pi • V)<^o- 

d) We can repeat the construction with the limiting passage that we carried out 

for a pair of charges in obtaining the dipole for the case of four charges (more pre- 
cisely, for two dipoles with moments pi and P 2 ) and obtain a quadrupole and a 
corresponding potential. In general we can obtain a multipole of order j with poten- 
tial Vj = (-iy(p 7 - • V)(P;_J.V) •••(?!• V )^0 = Ei+k+i=j y where 

yK Z l 

Qikl are the so-called components of the multipole moment. Carry out the com- 
putations and verify the formula for the potential of a multipole in the case of a 
quadrupole. 

e) Show that the main term in the asymptotics of the potential of a cluster of 
charges with increasing distance from the cluster is -2 , where Q is the total 
charge of the cluster. 

f) Show that the main term of the asymptotics of the potential of an electrically 

neutral body consisting of charges of opposite signs (for example, a molecule) at a 
distance that is large compared to the dimensions of the body is • Here e,- 

is a unit vector directed from the body to the observer; p—^fqi d, , where qt is the 
magnitude of the i th charge and d, is its radius-vector. The origin is chosen at some 
point of the body. 

g) The potential of any cluster of charges at a great distance from the cluster 
can be expanded (asymptotically) in functions of multipole potential type. Show 
this using the example of the first two terms of such a potential (see d), e), and 

f')). 

4. Determine whether the following domains are simply connected. 

a) the disk {(x, y) el 2 |.r 2 + r < 1); 

b) the disk with its center removed {(v, y) e R 2 | 0 < x 2 + y 2 < 1}; 

c) a ball with its center removed {(.r, y, z) e R 3 | 0 < x 2 + y 2 + z~ < 1}; 

d) an annulus {(.r, y) e R 2 | \ < x 2 + y 2 < 1}; 

e) a spherical annulus j(x,y,z)eR 3 | \ < x 2 + y 2 + z 2 < 1}; 

f) an anchor ring in R 3 . 

5. a) Give the definition of homotopy of paths with endpoints fixed. 

b) Prove that a domain is simply connected if and only if every two paths in 
it having common initial and terminal points are homotopic in the sense of the 
definition given in part a). 

6. Show that 

a) every continuous mapping / : S l — > S 2 of a circle S’ 1 (a one-dimensional 
sphere) into a two-dimensional sphere S 2 can be contracted in S 2 to a point (a 
constant mapping); 
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b) every continuous mapping / : S 2 S l is also homotopic to a single point; 

c) every mapping / : S 1 — >■ S 1 is homotopic to a mapping <p t-x ncp for some 
n e Z, where cp is the polar angle; 

d) every mapping of the sphere S 2 into an anchor ring is homotopic to a mapping 
to a single point; 

e) every mapping of a circle S 1 into an anchor ring is homotopic to a closed path 
encircling the hole in the anchor ring n times, for some n e Z. 

7 . In the domain R 3 \0 (three-dimensional space with the point 0 removed) con- 
struct: 

a) a closed but not exact 2-form; 

b) a source-free vector field that is not the curl of any vector field in that domain. 

8. a) Can there be closed, but not exact forms of degree p < n — 1 in the domain 
D — M"\0 (the space R" with the point 0 removed)? 

b) Construct a closed but not exact form of degree p — n — 1 in D = M"\0. 

9 . If a 1-form a> is closed in a domain D C 1 ", then by Proposition 2 every point 
x e D has a neighborhood U (x) inside which oo is exact. From now on co is assumed 
to be a closed form. 

a) Show that if two paths yi ■ [0, 1] — > D, i = 1, 2, have the same initial and 
terminal points and differ only on an interval [a, /l] C [0, 1] whose image under 
either of the mappings y, is contained inside the same neighborhood U (x ) , then 



b) Show that for every path [0, 1] 3 t i-»- y (t) e D one can find a number S > 0 
such that if the path y has the same initial and terminal point as y and differs from 
y at most by S, that is maxo< r <i \y (t) — y (f)| < 8, then fpCO= J co. 

c) Show that if two paths y\ and y 2 with the same initial and terminal points are 
homotopic in D as paths with fixed endpoints, then f y j co = f 2 co for any closed 
form co in I). 

10 . a) It will be proved below that every continuous mapping r : I 2 —> D of the 
square 1 2 can be uniformly approximated with arbitrary accuracy by a smooth map- 
ping (in fact by a mapping with polynomial components). Deduce from this that if 
the paths y\ and yj in the domain D are homotopic, then for every e > 0 there exist 
smooth mutually homotopic paths y\ and yj such that maxo<r<i |yy(f) — yt (01 < £> 
i = 1,2. 

b) Using the results of Example 9, show now that if the integrals of a closed 
form in D over smooth homotopic paths are equal, then they are equal for any paths 
that are homotopic in this domain (regardless of the smoothness of the homotopy). 
The paths themselves, of course, are assumed to be as regular as they need to be for 
integration over them. 

11 . a) Show that if the forms a > p , co p ~ l , and co p ~ l are such that co p = d cjo p ~ { = 
d<x ) p ~ l , then (at least locally) one can find a form co p ~ 2 such that 5 P_1 = co p ~ l + 
d co p ~ 2 . (The fact that any two forms that differ by the differential of a form have the 
same differential obviously follows from the relation d 2 <y = 0.) 
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b) Show that the potential cp of an electrostatic field (Problem 3) is determined 
up to an additive constant, which is fixed if we require that the potential tend to zero 
at infinity. 

12 . The Maxwell equations (formula (14.12) of Sect. 14.1) yield the following pair 
of magnetostatic equations: V ■ B = 0, V x 11= — 772 ■ The first of these shows that 
at least locally, B has a vector potential A, that is, B = V x A. 

a) Describe the amount of arbitrariness in the choice of the potential A of the 
magnetic field B (see Problem 1 la)). 

b) Let x,y,z be Cartesian coordinates in R 3 . Find potentials A for a uniform 
magnetic field B directed along the z-axis, each satisfying one of the following 
additional requirements: the field A must have the form (0, A y , 0); the field A must 
have the form (A Xl 0, 0); the field A must have the form (A x , A y , 0); the field A 
must be invariant under rotations about the z-axis. 

c) Show that the choice of the potential A satisfying the additional require- 
ment V • A = 0 reduces to solving Poisson’s equation; more precisely, to finding a 
scalar- valued function \jr satisfying the equation Afi = f for a given scalar- valued 
function /. 

d) Show that if the potential A of a static magnetic field B is chosen so that 

V • A = 0, it will satisfy the vector Poisson equation A A — — ■ Thus, invoking 

the potential makes it possible to reduce the problem of finding electrostatic and 
magnetostatic fields to solving Poisson’s equation. 

13 . The following theorem of Helmholtz}® is well known: Every smooth field F 
in a domain D of oriented Euclidean space R 3 can be decomposed into a sum 
F = Fj + F 2 of an irrotational field F[ and a solenoidal field Ft. Show that the 
construction of such a decomposition can be reduced to solving a certain Poisson 
equation. 

14 . Suppose a given mass of a certain substance passes from a state characterized 

thermodynamically by the parameters Vo, Pq(Tq ) into the state V, P, ('/’). Assume 
that the process takes place slowly (quasi-statically) and over a path y in the plane of 
states (with coordinates V, P ). It can be proved in thermodynamics that the quantity 
S = where <5 Q is the heat exchange form, depends only on the initial point 

(Vo, P 0 ) and the terminal point (V, P) of the path, that is, after one of these points is 
fixed, for example (Vo, Po), S becomes a function of the state (V, P) of the system. 
This function is called the entropy of the system. 

a) Deduce from this that the form 00 = ^ is exact, and that o> = dS. 

b) Using the form of 8 Q given in Problem 6 of Sect. 13.1 for an ideal gas, find 
the entropy of an ideal gas. 


10 H.L.F. Helmholtz (1821-1894) - German physicist and mathematician; one of the first to dis- 
cover the general law of conservation of energy. Actually, he was the first to make a clear distinction 
between the concepts of force and energy. 
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14.4 Examples of Applications 

To show the concepts we have introduced in action, and also to explain the physical 
meaning of the Gauss-Ostrogradskii-Stokes formula as a conservation law, we shall 
examine here some illustrative and important equations of mathematical physics. 


14.4.1 The Heat Equation 


We are studying the scalar field T — T (x, y, z, t) of the temperature of a body being 
observed as a function of the point (x,y,z) of the body and the time t. As a result 
of heat transfer between various parts of the body the field T may vary. However, 
this variation is not arbitrary; it is subject to a particular law which we now wish to 
write out explicitly. 

Let D be a certain three-dimensional part of the observed body bounded by a 
surface S. If there are no heat sources inside S, a change in the internal energy of 
the substance in D can occur only as the result of heat transfer, that is, in this case 
by the transfer of energy across the boundary S of D. 

By computing separately the variation in internal energy in the volume D and the 
flux of energy across the surface S, we can use the law of conservation of energy to 
equate these two quantities and obtain the needed relation. 

It is known that an increase in the temperature of a homogeneous mass m by 
AT requires energy cm AT, where c is the specific heat capacity of the substance 
under consideration. Hence if our field T changes by AT — T{x, y, z, t + At) — 
T(x, y, z. t ) over the time interval At, the internal energy in D will have changed 
by an amount 


cpATdV, 


( 14 . 63 ) 


where p — p(x, y, z) is the density of the substance. 

It is known from experiments that over a wide range of temperatures the quantity 
of heat flowing across a distinguished area d<r = n da per unit time as the result of 
heat transfer is proportional to the flux — grad T ■ d<r of the field — grad T across 
that area (the gradient is taken with respect to the spatial variables x,y,z ). The 
coefficient of proportionality k depends on the substance and is called its coefficient 
of thermal conductivity. The negative sign in front of grad T corresponds to the fact 
that the energy flows from hotter parts of the body to cooler parts. Thus, the energy 
flux (up to terms of order o( At)) 


At 


—k grad T ■ dc 


( 14 . 64 ) 


takes place across the boundary S of D in the direction of the external normal over 
the time interval At. 



304 


14 Elements of Vector Analysis and Field Theory 


Equating the quantity (14.63) to the negative of the quantity (14.64), dividing 
by At, and passing to the limit as At — > 0, we obtain 

JJJ c p-^~ dV — jj k grad T- da. (14.65) 

This equality is the equation for the function T. Assuming T is sufficiently 
smooth, we transform (14.65) using the Gauss-Ostrogradskii formula: 



div(k grad T) dV. 


Hence, since D is arbitrary, it follows obviously that 


dT 

cp — = div(kgrad T). 
3 1 


(14.66) 


We have now obtained the differential version of the integral equation (14.65). 

If there were heat sources (or sinks) in D whose intensities have density 
F{x, y, z, t), instead of (14.65) we would write the equality 



k grad T ■ d<r 



and then instead of (14.66) we would have the equation 


(14.65') 


cp— - = div(k grad T) + F. (14.66') 

at 

If the body is assumed isotropic and homogeneous with respect to its heat con- 
ductivity, the coefficient k in (14.66) will be constant, and the equation will trans- 
form to the canonical form 

d T 

— = a 1 AT + /, (14.67) 

ot 

where / = ^ and a 2 = ^ is the coefficient of thermal diffusivity. The equation 
(14.67) is usually called the heat equation. 

In the case of steady-state heat transfer, in which the field T is independent of 
time, this equation becomes Poisson ’s equation 


AT — cp, (14.68) 

where cp = — \f\ and if in addition there are no heat sources in the body, the result 
is Laplace’s equation 

AT = 0. (14.69) 

The solutions of Laplace’s equation, as already noted, are called harmonic 
functions. In the thermophysical interpretation, harmonic functions correspond to 
steady-state temperature fields in a body in which the heat flows occur without any 
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sinks or sources inside the body itself, that is, all sources are located outside the 
body. For example, if we maintain a steady temperature distribution T |gy = r over 
the boundary 3 V of a body, then the temperature field in the body V will eventually 
stabilize in the form of a harmonic function T . Such an interpretation of the solu- 
tions of the Laplace equation (14.69) enables us to predict a number of properties 
of harmonic functions. For example, one must presume that a harmonic function 
in V cannot have local maxima inside the body; otherwise heat would only flow 
away from these hotter portions of the body, and they would cool off, contrary to 
the assumption that the field is stationary. 


14.4.2 The Equation of Continuity 

Let p = p(x, y, z, t) be the density of a material medium that fills a space being 
observed and v = x(x , y,z,t ) the velocity field of motion of the medium as function 
of the point of space (x,y, z) and the time 1 . 

From the law of conservation of mass, using the Gauss-Ostrogradskii formula, 
we can find an interconnection between these quantities. 

Let D be a domain in the space being observed bounded by a surface S. Over the 
time interval At the quantity of matter in D varies by an amount 



Over this small time interval At, the flow of matter across the surface S in the 
direction of the outward normal to S is (up to o{At)) 


IL 


px • dcr. 


At ■ 


If there were no sources or sinks in D , then by the law of conservation of matter, 
we would have 



or, in the limit as At — ► 0 



Applying the Gauss-Ostrogradskii formula to the right-hand side of this equality 
and taking account of the fact that D is an arbitrary domain, we conclude that the 
following relation must hold for sufficiently smooth functions p and v: 



(14.70) 


called the equation of continuity of a continuous medium. 
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In vector notation the equation of continuity can be written as 

dp 

■£ + V-(pv) = 0, 
dt 


(14.70') 


or, in more expanded form, 

^+v- Vp + pV-v = 0. (14.70") 

dt 

If the medium is incompressible (a liquid), the volumetric outflow of the medium 
across a closed surface S must be zero: 



v • dcr = 0, 


from which (again on the basis of the Gauss-Ostrogradskii formula) it follows that 
for an incompressible medium 

div v = 0. (14.71) 


Hence, for an incompressible medium of variable density (a mixture of water and 
oil) Eq. (14.70") becomes 


— + v • Vp = 0. 
ot 


(14.72) 


If the medium is also homogeneous, then Vp = 0 and therefore = 0. 


14.4.3 The Basic Equations of the Dynamics of Continuous Media 

We shall now derive the equations of the dynamics of a continuous medium moving 
in space. Together with the functions p and v already considered, which will again 
denote the density and the velocity of the medium at a given point (x, y, z) of space 
and at a given instant t of time, we consider the pressure p — p(x, y, z. t) as a 
function of a point of space and time. 

In the space occupied by the medium we distinguish a domain D bounded by a 
surface S and consider the forces acting on the distinguished volume of the medium 
at a fixed instant of time. 

Certain force fields (for example, gravitation) are acting on each element pdV 
of mass of the medium. These fields create the so-called mass forces. Let F = 
F(x, y, z, t) be the density of the external fields of mass force. Then a force FpdV 
acts on the element from the direction of these fields. If this element has an acceler- 
ation a at a given instant of time, then by Newton’s second law, this is equivalent to 
the presence of another mass force called inertia, equal to — apdV. 

Finally, on each element da = n dcr of the surface S there is a surface tension 
due to the pressure of the particles of the medium near those in D, and this surface 
force equals — p da (where n is the outward normal to S ). 
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By d’Alembert’s principle, at each instant during the motion of any material sys- 
tem, all the forces applied to it, including inertia, are in mutual equilibrium, that is, 
the force required to balance them is zero. In our case, this means that 


(F — a)pdV — // pda — 0. 
d JJs 


(14.73) 


The first term in this sum is the equilibrant of the mass and inertial forces, and 
the second is the equilibrant of the pressure on the surface S bounding the volume. 
For simplicity we shall assume that we are dealing with an ideal (nonviscous) fluid 
or gas, in which the pressure on the surface dcr has the form p da, where the number 
p is independent of the orientation of the area in the space. 

Applying formula (14.47) from Sect. 14.2, we find by (14.73) that 


(F — a)pdV — 


JJJ grad p di> = 0 , 


from which, since the domain D is arbitrary, it follows that 


pa = pF — gradp. 


(14.74) 


In this local form the equation of motion of the medium corresponds perfectly to 
Newton’s law of motion for a material particle. 

The acceleration a of a particle of the medium is the derivative ^ of the velocity 

v of the particle. If x = x(t), y = y(t), z = z(t) is the law of motion of a particle 

in space and v = v(x, y,z,t) is the velocity field of the medium, then for each 
individual particle we obtain 

dv 3v 3v d.r 3v dy 3v dz 

a = — = b b ~b 

dr 3 1 dx dr dy dr 3 z dr 

or 

3v 

a = — + (v- V)v. 

at 

Thus the equation of motion (14.74) assumes the following form 

dv 1 

= F- -gradp (14.75) 

dr p 

or 

3v 1 

— + (v- V)v = F- -Vp. (14.76) 

3 r p 

Equation (14.76) is usually called Euler’s hydrodynamic equation. 

The vector equation (14.76) is equivalent to a system of three scalar equations 
for the three components of the vector v and the pair of functions p and p. 

Thus, Euler’s equation does not completely determine the motion of an ideal 
continuous medium. To be sure, it is natural to adjoin to it the equation of continuity 
(14.70), but even then the system is underdetermined. 
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To make the motion of the medium determinate one must also add to Eqs. (14.70) 
and (14.76) some information on the thermodynamic state of the medium (for ex- 
ample, the equation of state f(p,p,T) = 0 and the equation for heat transfer). The 
reader may obtain some idea of what these relations can yield in the final subsection 
of this section. 


14.4.4 The Wave Equation 

We now consider the motion of a medium corresponding to the propagation of an 
acoustic wave. It is clear that such a motion is also subject to Eq. (14.76); this 
equation can be simplified due to the specifics of the phenomenon. 

Sound is an alternating state of rarefaction and compression of a medium, the 
deviation of the pressure from its mean value in a sound wave being very small - 
of the order of 1 %. Therefore acoustic motion consists of small deviations of the 
elements of volume of the medium from the equilibrium position at small velocities. 
However, the rate of propagation of the disturbance (wave) through the medium is 
comparable with the mean velocity of motion of the molecules of the medium and 
usually exceeds the rate of heat transfer between the different parts of the medium 
under consideration. Thus, an acoustic motion of a volume of gas can be regarded 
as small oscillations about the equilibrium position occurring without heat transfer 
(an adiabatic process). 

Neglecting the term (v • V)v in the equation of motion (14.76) in view of the 
small size of the macroscopic velocities v, we obtain the equality 

3v 

p— = pF — Vp. 

If we neglect the term of the form v for the same reason, the last equality 
reduces to the equation 

~ (pv) = pF — Vp. 
ot 

Applying the operator V (on x,y,z coordinates) to it, we obtain 

i~(V • pv) = V • pF — Ap. 
at 

Using the equation of continuity (14.70') and introducing the notation V • pF = 
—<f, we arrive at the equation 


3 2 p 

W=* +Ap - 


(14.77) 
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If we can neglect the influence of the exterior fields, Eq. (14.77) reduces to the 
relation 

d 2 p 

J = AP (14 ' 78) 

between the density and pressure in the acoustic medium. Since the process is adi- 
abatic, the equation of state f{p, (). T) = 0 reduces to a relation p = i//(/j), from 
which it follows that |-£ = \ k'(p)\p; + ^"(pH^f) 2 Since the pressure oscillations 
are small in an acoustic wave, one may assume that i fr'(p) = ir'(po), where po is the 
equilibrium pressure. Then i jr” = 0 and ~ if'(p)jjr- Taking this into account, 
from (14.78) we finally obtain 

3 2 p 2 

j£=a 2 Ap, (14.79) 

where a — (V f, (7’o))~ 1 ' 2 - This equation describes the variation in pressure in a 
medium in a state of acoustic motion. Equation (14.79) describes the simplest wave 
process in a continuous medium. It is called the homogeneous wave equation. The 
quantity a has a simple physical meaning: it is the speed of propagation of an acous- 
tic disturbance in the medium, that is, the speed of sound in it (see Problem 4). 

In the case of forced oscillations, when certain forces are acting on each element 
of volume of the medium, the three-dimensional density of whose distribution is 
given, Eq. (14.79) is replaced by the relation 


3 2 p 

W 


= a 2 Ap + f 


(14.80) 


corresponding to Eq. (14.77), which for / # 0 is called the inhomogeneous wave 
equation. 


14.4.5 Problems and Exercises 

1. Suppose the velocity field v of a moving continuous medium is a potential field. 
Show that if the medium is incompressible, the potential (p of the field v is a har- 
monic function, that is, Aip — 0 (see (14.7 1). 

2. a) Show that Euler’s equation (14.76) can be rewritten as 

\ 1 

I — v x curl v = F grad p 

) P 

(see Problem 1 of Sect. 14.1). 

b) Verify on the basis of the equation of a) that an irrotational flow (curlv = 0 ) 
of a homogeneous incompressible liquid can occur only in a potential field F. 

c) It turns out (Lagrange’s theorem) that if at some instant the flow in a po- 
tential field F = grad U is irrotational, then it always has been and always will be 
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irrotational. Such a flow consequently is at least locally a potential flow, that is, 
v = grad <p. Verify that for a potential flow of a homogeneous incompressible liquid 
taking place in a potential field F, the following relation holds at each instant of 
time: 

( d cp v 2 p \ 

grad U + T + it- t ') = 0 - 

d) Derive the so-called Cauchy integral from the equality just obtained: 


3 (p 

37 


2 


+ P-U = 0(t), 
P 


a relation that asserts that the left-hand side is independent of the spatial coordinates. 

e) Show that if the flow is also steady-state, that is, the field v is independent of 
time, the following relation holds 


2 

V P 

1 U = const, 

2 p 


called the Bernoulli integral. 

3. A flow whose velocity field has the form v = (v x , v y , 0) is naturally called plane- 
parallel or simply a planar flow. 

a) Show that the conditions divv = 0, curl v = 0 for a flow to be incompressible 
and irrotational have the following forms: 

dv x dv y dv x dv y 

dx 3 y ’ 3 y dx 

b) Show that these equations at least locally guarantee the existence of functions 
i jr{x, y) and q>(x, y) such that (—v y , v x ) = grad f and (v x , v y ) = grad <p. 

c) Verify that the level curves cp = c\ and xf = C 2 of these functions are orthogo- 
nal and show that in the steady-state flow the curves i jr = c coincide with the trajec- 
tories of the moving particles of the medium. It is for that reason that the function 
jr is called the current function, in contrast to the function q>, which is the velocity 
potential. 

d) Show, assuming that the functions <p and xjr are sufficiently smooth, that they 
are both harmonic functions and satisfy the Cauchy-Riemann equations'. 


3 (p djs 3 (p dfl 

dx 3 y ’ 3 y dx 

Harmonic functions satisfying the Cauchy-Riemann equations are called conjugate 
harmonic functions. 

e) Verify that the function f(z) — (cp + if)(x, y), where z = x + iy, is a dif- 
ferentiable function of the complex variable z. This determines the connection of 
the planar problems of hydrodynamics with the theory of functions of a complex 
variable. 
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4. Consider the elementary version = a 2 |-§ of the wave equation (14.79). This 
is the case of a plane wave in which the pressure depends only on the x -coordinate 
of the point ( x,y,z ) of space. 

a) By making the change of variable u = x — at, v = x +at, reduce this equation 

o2 

to the form — T- = 0 and show that the general form of the solution of the original 
equation is p — f(x + at) + g(x — at), where / and g are arbitrary functions of 
class C< 2 >. 

b) Interpret the solution just obtained as two waves f(x) and g(x) propagating 
left and right along the x-axis with velocity a. 

c) Assuming that the quantity a is the velocity of propagation of a distur- 
bance even in the general case (14.79), and taking account of the relation a — 
W(po)) 1 ' 2 , find, following Newton, the velocity cy of sound in air, assuming 
that the temperature in an acoustic wave is constant, that is, assuming that the 
process of acoustic oscillation is isothermic. (The equation of state is p — 

R — 8.31 de „., nole is the universal gas constant, and p — 28.8 i s the molec- 
ular weight of air. Carry out the computation for air at a temperature of 0 °C, that 
is, T = 273 K. Newton found that cp = 280 m/s.) 

d) Assuming that the process of acoustic vibrations is adiabatic, find, following 
Laplace, the velocity cl of sound in air, and thereby sharpen Newton’s result c,y . 
(In an adiabatic process p — cp Y . This is Poisson’s formula from Problem 6 of 
Sect. 13.1. Show that if cp = F^, then cl = Jy L. For air y ~ 1 .4. Laplace found 
cl — 330 m/s, which is in excellent agreement with experiment.) 

5. Using the scalar and vector potentials one can reduce the Maxwell equations 
((14.12) of Sect. 14.1) to the wave equation (more precisely, to several wave equa- 
tions of the same type). By solving this problem, you will verify this statement. 

a) It follows from the equation V • B = 0 that at least locally B = V x A, where 
A is the vector potential of the field B. 

b) Knowing that B = V x A, show that the equation V x E = — |y- implies that 
at least locally there exists a scalar function cp such that E = — V(p — 'jjy. 

c) Verify that the fields E = — V(p — ^ and B = V x A do not change if instead 
of <p and A we take another pair of potentials <p and A such that cp — ip — ^ and 
A = A + Vi/f, where i jr is an arbitrary function of class C®. 

d) The equation V ■ E = j- implies the first relation —V 2 (p — ^-V • A = 
between the potentials cp and A. 

e) The equation c 2 V = T implies the second relation 

7 7i 9 9 2 A j 

at dt z sq 


between the potentials cp and A. 
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1 q2 

f) Using c), show that by solving the auxiliary wave equation A\[r + f = , 

without changing the fields E and B one can choose the potentials (p and A so that 
they satisfy the additional (so-called gauge ) condition V • A = — \ || . 

g) Show that if the potentials (p and A are chosen as stated in f), then the required 
inhomogeneous wave equations 


3 2 (p 
3 1 2 


= c 2 A(p + 



3 2 A 
3 1 2 


= c 2 AA + 


£0 


for the potentials ip and A follow from d) and e). By finding <p and A, we also find 
the fields E = Vip, B = V x A. 



Chapter 15 

^Integration of Differential Forms on Manifolds 


15.1 A Brief Review of Linear Algebra 

15.1.1 The Algebra of Forms 


Let X be a vector space and F k : X k — »■ R. a real-valued &-form on X. If e\, . . . , e n 
is a basis in X and xi = x' l ej 1 , ...,Xk = x' k ej k is the expansion of the vectors 
x\ , . . . , Xk e X with respect to this basis, then by the linearity of F k , with respect to 
each argument 

F k (x i ,...,Xk)= F k (x n e il x‘ k e ik ) = 

= F k (e h ,. . . , e ik )x 11 ■ . . . ■ x‘ k = a h _ ik x 11 ■ x‘ k . (15.1) 


Thus, after a basis is given in X, one can identify the A:-form F k : X k R with 
the set of numbers at l .._i k = F k (ei l , . . . , ei k ). 

If e \, . . . , e n is another basis in X and then, setting 

ej = c‘j €( , j = 1 , . . . , n, we find the (tensor) law 


~ t-i k / l\ 1 lc \ i 

a h~j k = F { c h e h’ ■■■’ c j k e h) = a h-k c 


• . . . • c l f 

Jk 


(15.2) 


for transformation of the number sets aj u . j k corresponding to the same 

form F k . 

The set T k := {F k : X k — > R) of /c-forms on a vector space X is itself a vector 
space relative to the standard operations 

{F k + F k )(x) := F k (x) + F k (x), (15.3) 

(XF k )(x):=XF k (x) (15.4) 

of addition of f'-forms and multiplication of a A' -form by a scalar. 
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For forms F k and F l of arbitrary degrees k and / the following tensor product 
operation ® is defined: 


(F k <g> F l ) (xi ,...,Xk, Xk+ 1 , . . . , Xk+i) := 
= F k (x i, . . . , Xk)F^ (xk+h • • ■ , x k +i). 


(15.5) 


Thus F k 0 F 1 is a form F k+I of degree k + l. The following relations are obvious: 


(XF k )®F l =\(F k ®F l ), 


(15.6) 


(F k + F k ) ®F l = F k ®F l + F k ® F l , 


(15.7) 


F k ® (F[ + F[) = F k ® F[ + F k ® F[, 


(15.8) 


(F k ® F 1 ) ® F m = F k ® ( F l ® F m ). 


(15.9) 


Thus the set T = {F k } of forms on the vector space X is a graded algebra T — 
0^. J~ k with respect to these operations, in which the vector-space operations are 
carried out inside each space T k occurring in the direct sum, and if F k e T k , F ] e 
F', then F k ® F> e F k+l . 

Example 1 Let X* be the dual space to X (consisting of the linear functionals on X ) 
and e 1 , . e" the basis of X* dual to the basis e \, . . . , e n in X, that is, e' (e ; ) = 8\. 

Since e l {x) = e 1 (x J ej) = x J e'(ej) = x J 8 l j — x‘ , taking account of (15.1) and 
(15.9), we can write any £-form F k : X k —*■ R as 


F k = a h .., ik e il ® ® e ik . 


(15.10) 


15.1.2 The Algebra of Skew-Symmetric Forms 

Let us now consider the space F2 k of skew-symmetric forms in F k , that is, a> e Q k 
if the equality 

a>(x\, .... Xi, ..., Xj Xk) = —at (x\, .... xj , Xi Xk) 

holds for any distinct indices i,;'e(l,...,n}. 

From any form F k £ T k one can obtain a skew-symmetric form using the oper- 
ation A : T — > Q k of alternation, defined by the relation 



(15.11) 
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where 


x' i "■'* — 

°i ...k ~ 


1 , if the permutation 
— 1 , if the permutation 
0, if 


is not a permutation. 


ik ) 


k) 

| is even, 

ik ) 

| is odd, 

k J 


If F k is a skew-symmetric form, then, as one can see from (15.11), AF k — F k . 
Thus A( AF k ) — AF k and Aco — co if co e F2 k . Hence A : T k — > F2 k is a mapping 
of T k onto Q k . 

Comparing Definitions (15.3), (15.4), and (15.11), we obtain 


A(cf + F|) = AF k + AF\, 
A(XF k ) = XAF k . 


(15.12) 

(15.13) 


Example 2 Taking account of relations (15.12) and (15.13), we find by (15.10) that 
AF k = a h .j k A(e il <g> • • • <g> e‘ k ), 


so that it is of interest to find A(e n ® ® e‘ k ). 

From Definition (15.11), taking account of the relation e l (x) — x‘ , we find 

A(e - 71 ® • • • <g> e ^ k ) (xi , . . . , Xk) = 

= ' ' ' • ' eJk ^ S L.'.k k = 


= —XI • . . . • X; 


Jk oil •••Ik 


k\ n 






„Jk 


yjk 


(15.14) 


The tensor product of skew- symmetric forms is in general not skew-symmetric, 
so that we introduce the following exterior product in the class of skew-symmetric 
forms: 


k l (*' 
co k A to 1 := 


0 ! 


kill 


A[co k <g> co 1 ). 


(15.15) 


Thus a> k Aco 1 is a skew-symmetric form co k+l of degree k + l. 
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Example 3 Based on the result (15.14) of Example 2, we find by Definition (15.15) 
that 


2 ! 


e n Ae l2 {x\,x 2 ) — yyyyA(e U <g> e' 2 ) (xi , X2) — 


e n {xi) 

e n (x 2 ) 


e l2 (x\) 

e ,2 (xi) 



(15.16) 


Example 4 Using the equality obtained in Example 3, relation (15.14), and the def- 
initions (15.11) and ( 1 5 . 1 5), we can write 

e n A ( e l 2 A e' 3 )(xi,X2, X3 ) = 

= A(e n <g> (e 12 ®e l3 ))(xi,X2,X3) = 

- (xjMe' 2 A e l3 )(x n ,x h )^^ = ^x'* 


h 

x i 

*2 

X 2 

*2 

x 3 

*3 

X 2 

'3 

Xj 

X*2 

*2 

Xf 

*2 

x 3 

h 

x i 

Xj 

h 

+ x'> ^ 

+ X 3 i 2 

X 2 

h 

x i 

x 2 

x; 1 

h 

x l 

13 

X l 






x'2 

h 

x 2 z 

X2 






x‘2 

h 

x 3 

x 3 







A similar computation shows that 

e h A ( e‘ 2 A e‘ 3 ) = (e h A e h ) A e‘ 3 . (15.17) 


j 2 
Jl 


e. 71.7273 
°1 2 3 


Using the expansion of the determinant along a column, we conclude by induc- 
tion that 


e 11 A ■■■ A e tk (xi, ... , Xk) = 


e ll (x 1 ) • 

e' k (xi) 

e ll (xk) ■■ 

e f *(x*) 


(15.18) 


and, as one can see from the computations just carried out, formula (15.18) holds 
for any 1-forms e n , . . . , e' k (not just the basis forms of the space A*). 

Taking the properties of the tensor product and the alternation operation listed 
above into account, we obtain the following properties of the exterior product of 
skew-symmetric forms: 


[co J + Cl>2 ) A (J — ft) [ A (J + a>2 A to } , 


(Aft/) A cJ — A (a /), 


(15.19) 

(15.20) 
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co k Aco 1 = (-l) kl co l Aco k , (15.21) 

(co k Aco l )Aco m =w k A(co l Aco m ). (15.22) 

Proof Equalities (15.19) and (15.20) follow obviously from relations (15.6)— (15.8) 
and (15.12) and (15.13). 

From relations (15.1 0)— ( 15.14) and (15.17), for every skew-symmetric form co = 
a h . Jk e" <g> ■ • • <g> e‘ k we obtain 


co — Aco — cii { ...i k A(e n ® ■ ■ ■ <g> e ,k ) = — A ■ • • A e' k . 

Using the equalities (15.19) and (15.20) we see that it now suffices to verify 
(15.21) and (15.22) for the forms e' 1 A • • • A e' k . 

Associativity (15.22) for such forms was already established by (15.17). 

We now obtain (15.21) immediately from (15.18) and the properties of determi- 
nants for these particular forms. □ 

Along the way we have shown that every form co e Q k can be represented as 

co= a h ... ik e h A • ■ ■ A e ik . (15.23) 

l<i’l <i2<-<ik— n 

Thus, the set Q = { Q k } of skew- symmetric forms on the vector space X relative 
to the linear vector-space operations (15.3) and (15.4) and the exterior multiplication 
(15.15) is a graded algebra Q — F2 k . The vector-space operations on Q are 

carried out inside each vector space P2 k , and if of e C2 k , (J e f2 / , then co k A co 1 e 
P2 k+1 . 

In the direct sum ®Q k the summation runs from zero the dimension of the 
space X, since the skew-symmetric forms o> k : X k -> LRi of degree larger than the 
dimension of X are necessarily identically zero, as one can see by (15.21) (or from 
relations (15.23) and (15.8)). 


15.1.3 Linear Mappings of Vector Spaces and the Adjoint 
Mappings of the Conjugate Spaces 

Let X and Y be vector spaces over the field R of real numbers (or any other field, so 
long as it is the same field for both X and Y ), and let / : X Y be a linear mapping 
of X into Y, that is, for every x, x\ . X 2 e X and every 7. e R, 

l(x\ + xf) = l(x\) + Zfe) and l(Xx) = Xl(x). (15.24) 

A linear mapping / : X — > Y naturally generates its adjoint mapping l* : Ty — > 
Tx from the set of linear functionals on Y (Ty ) into the analogous set Tx ■ If Fy is 
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a A; -for m on Y, then by definition 

(l*F$)(x i, ...,x k ):= F$(lx i, . . . , lx k ). (15.25) 

It can be seen by (15.24) and (15.25) that l*Fy is a A -form F% on X, that is, 
l*(Ty) C T k x . Moreover, if the form Fy was skew-symmetric, then (l*Fy) — F x is 
also skew-symmetric, that is, l*{£2y) C F? l x . Inside each vector space and F2y 
the mapping l* is obviously linear, that is, 

l*(F k + F%)=l*F k + l*F% and l*(XF k ) = XI* F k . (15.26) 

Now comparing definition (15.25) with the definitions (15.5), (15.11), and 
(15.15) of the tensor product, alternation, and exterior product of forms, we con- 
clude that 


l*(F p <g> F q ) = (l*F p ) <g> (l*F q ), (15.27) 

l*(AF p ) = A(l*F p ), (15.28) 

l*(co p Aco q ) = (l*co p )A(l*co q ). (15.29) 

Example 5 Let e \, . . . , e m be a basis in X, e \, . . . , e n a basis in Y, and I(e,) — 
c\ej, i e {1, . . . , m}, j e {1, . . . , n}. If the A-form Fy has the coordinate representa- 
tion 

Fyiyi — ,yk) = bj l ...j k y J l 1 ■ . ■ y J k k 
in the basis e \, . . . , e n , where b/ [ j t — F k ,{ej l , . . . , e.j k ) , then 

(l*Fy)(x i, . . . , Xk) = aiy.. ik x^ ■ . . . ■ x l k k , 

where a h .j k = b h .. Jk Cp t • . . . • C£, since 

a h...i k -■ (!* F yY e h - • ■ ■ > e i :*) := FyQe^ le ih ) = 

= PyKe h , . . . , C*e jt ) = F k (e h , . . . , e jk )C» • . . . • C j. 


Example 6 Let e 1 , . . . , e m and e\ ... ,e n be the bases of the conjugate spaces X* 
and Y* dual to the bases in Example 5. Under the hypotheses of Example 5 we 
obtain 


(l*P)(x) — (Z*e- , )(x ! e ( ) = e-' (x'/e;) —x l c^(c k ek) = 

= x l c k e j (e k ) = x l c k 8 J k = c[x l = cj e l (x). 

Example 7 Retaining the notation of Example 6 and taking account of relations 
(15.22) and (15.29), we now obtain 
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l* (e h a • ■ ■ A e jk ) = l*e h A ■ • ■ A I*e jk = 

= (c/V 1 ) A ■ ■ • A ( cj k e‘ k ) = c / 1 • . . . • c/V 1 A ■ ■ ■ A e ' 1 

\ n / \ ik / 1 1 4 


E 

l<il <-~<ik<m 


J l 




Jk 


e n A ■■■ A e' k . 


‘k ik 

Keeping Eq. (15.26) in mind, we can conclude from this that 


Z *( E b h-ik« n A-Ac A ' = 


1— fl <"'<jk— n 


E */i 


l</l <—<ik<m 
1 <71 <•••<**<»* 


4 


ik 


e l A • • • A r 


F. fl^V’A-AC 11 

l</l < — </jk<m 


15.1.4 Problems and Exercises 

1. Show by examples that in general 

a) F k ® F 1 ^ F' ® F k \ 

b) A(F k ®F l )iLAF k ®AF l \ 

c) if F k , F 1 e £2, then it is not always true that F k ® F 1 e £2 . 

2. a) Show that if ei, . . . , e„ is a basis of the vector space X and the linear func- 
tionals e 1 , . . . , e" on X (that is elements of the conjugate space X*) are such that 
e-i (ej) = S [ , then e l , ... ,e n is a basis in X*. 

b) Verify that one can always form a basis of the space T k = T k ( X ) from k- 

forms of the form e n ® ® e‘ k , and find the dimension (dim F k ) of this space, 

knowing that dim X = n . 

c) Verify that one can always form a basis of the space S2 k from forms of the 
form e ' 1 A • ■ • A e‘ k , and find dim F2 k knowing that dim X — n. 

d) Show that if F2 — ®*Zq= - then dim £2 = 2 n . 

3. The exterior {Grassmanriy algebra G over a vector space X and a held P (usu- 
ally denoted /\ (X) in agreement with the symbol A for the multiplication operation 


*H. Grassmann (1809-1877) - German mathematician, physicist and philologist; in particular, he 

created the first systematic theory of multidimensional and Euclidean vector spaces and gave the 
definition of the inner product of vectors. 
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in G) is defined as the associative algebra with identity 1 having the following prop- 
erties: 

1° G is generated by the identity and X, that is, any subalgebra of G containing 
1 and X is equal to G; 

2° x A x — 0 for every vector x e X ; 

3° dim G — 2 dimX . 

a) Show that if e\,...,e n is a basis in X, then the set 1, e\, . . . ,e n ,e\ A 
e 2 , ■■■, e n -\ A e „ , . . . , e\ A • • • A e„ of elements of G of the form e (1 A • • • A et k = e/ , 
where I = [i\ < ■■ ■ <i/ c } C {1, 2, . . . , «}, forms a basis in G. 

b) Starting from the result in a) one can carry out the following formal construc- 
tion of the algebra G — /\(X). 

For the subsets I — {i'i, . . . , i^} of {1, 2, . . . , n] shown in a) we form the formal 
elements e/, (by identifying q q with ej, and e 0 with 1), which we take as a basis of 
the vector space G over the field P. We define multiplication in G by the formula 

bjej J — y^aibjsjl, /)e/uy, 

' 1 ' ' j ' i.j 

where e(7, ./) = sgn]~[ (€/ j e j(j — i )■ Verify that the Grassmann algebra f\(X) is 
obtained in this way. 

c) Prove the uniqueness (up to isomorphism) of the algebra /\(X). 

d) Show that the algebra /\(X) is graded: f\(X) — @£=0 A*C^0> where 
/\ k (X) is the linear span of the elements of the form A • • • A et k ; here if a e 
f\ p (X) and b e /\ q (X), then a Abe /\ p+q (X). Verify that a A b = (- 1 ) pq b A a. 

4. a) Let A : X — > Y be a linear mapping of X into Y. Show that there exists a 
unique homomorphism /\ ( A ) : f\(X) —> /\(Y) from /\(X) into f\(Y) that agrees 
with A on the subspace / \'(X ) C /\(X) identified with X. 

b) Show that the homomorphism /\(A) maps / \ k (X ) into /\ k (Y). The restric- 
tion of /\(A) to /\ k (X) is denoted by /\ k (A). 

c) Let {e,-: /' = 1 m] be a basis in X and { ej : j — 1, . . . , n } a basis in Y, 

and let the matrix (a'. ) correspond to the operator A in these bases. Show that if 
{ ei : I C {1, . . . , /«}}, { ej : J C {1, . . . , «}} are the corresponding bases of the spaces 
f\{X) and f\ (F), then the matrix of the operator /\ k (A) has the form aj = det(fl' ), 
i e I , j e J , where card I — card J = k. 

d) Verify that if A : X — > T, B : Y — > Z are linear operators, then the equality 
A (B o A) = / \(B ) o A (A) holds. 
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15.2 Manifolds 

15.2.1 Definition of a Manifold 

Definition 1 A Hausdorff topological space whose topology has a countable base 2 
is called an n-dimensional manifold if each of its points has a neighborhood U 
homeomorphic either to all of R" or to the half-space H" = {x e R" I x 1 < 0}. 

Definition 2 A mapping tp : R" — > U C M (or <p : //" — * U C M) that realizes the 
homeomorphism of Definition 1 is a local chart of the manifold M, IB" (or H") is 
called the parameter domain , and U the range of the chart on the manifold M. 

A local chart endows each point x e U with the coordinates of the point t = 
tp~ x (x) e R” corresponding to it. Thus, a local coordinate system is introduced in 
the region U ; for that reason the mapping tp, or, in more expanded notation, the pair 
(U, tp) is a map of the region U in the ordinary meaning of the term. 

Definition 3 A set of charts whose ranges taken together cover the entire manifold 
is called an atlas of the manifold. 

Example 1 The sphere S 2 = {x e R 3 | |x| = 1} is a two-dimensional manifold. If 
we interpret S 2 as the surface of the Earth, then an atlas of geographical maps will 
be an atlas of the manifold S 2 . 

The one-dimensional sphere S l = {x e R 2 | |x| = 1} - a circle in R 2 - is obvi- 
ously a one-dimensional manifold. In general, the sphere S n — {x e R" +1 | \x\ = 1} 
is an n-dimensional manifold. (See Sect. 12.1.) 

Remark 1 The object (the manifold M) introduced by Definition 1 obviously does 
not change if we replace R" and H" by any parameter domains in R" homeomor- 
phic to them. For example, such a domain might be the open cube I n = \x e R" | 
0 < x l < 1, i = 1, . . . , «} and the cube with a face attached /" = {x e R" | 0 < 
x 1 < 1 , 0 < jc' < 1, i —2 Such standard parameter domains are used quite 
often. 

It is also not difficult to verify that the object introduced by Definition 1 does 
not change if we require only that each point x e M have a neighborhood U in M 
homeomorphic to some open subset of the half-space H n . 

Example 2 If X is an m -dimensional manifold with an atlas of charts {(U a ,(p a )} and 
Y is an n -dimensional manifold with atlas {(V^, i/p)}, then X/Y can be regarded as 
an (in + n )-dimensional manifold with the atlas {(W a p, Xafi)}, where W a p = U a x 
Vp and the mapping Xap — (<Pa, ffi ) maps the direct product of the domains of 
definition of tp a and i//« into W a p. 


2 See Sect. 9.2 and also Remarks 2 and 3 in the present section. 
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Fig. 15.1 



In particular, the two-dimensional torus T 2 = S l x .S' 1 (Fig. 12.1) or the « -di- 
mensional torus T n — 5 1 x • • • x S 1 is a manifold of the corresponding dimension. 

v— ' 

n factors 

If the ranges t/,- and U j of two charts (17,, (pi) and (Uj, <pj) of a manifold M in- 
tersect, that is, Ui fl U j ^ 0, mutually inverse homeomorphisms cpij : fj — ► Iji and 
(Pji : Iji — > Iij naturally arise between the sets hj = <Pi l (Uj ) and Iji = <pj 1 (U i ). 
These homeomorphisms are given by q>ij = <pj l o <pi\i ij and (pji = (pf ] o (fijlijr 
These homeomorphisms are often called changes of coordinates, since they effect 
a transition from one local coordinate system to another system of the same kind in 
their common range Ui HU j (Fig. 15.1). 

Definition 4 The number n in Definition 1 is the dimension of the manifold M and 
is usually denoted dim M. 

Definition 5 If a point tp~ l (x) on the boundary 3 H n of the half-space H" corre- 
sponds to a point x e U under the homeomorphism tp : H n -> U, then x is called 
a boundary point of the manifold M (and of the neighborhood U). The set of all 
boundary points of a manifold M is called the boundary of this manifold and is 
usually denoted 3 M. 

By the topological invariance of interior points (Brouwer’s theorem 1 ) the con- 
cepts of dimension and boundary point of a manifold are unambiguously defined, 
that is, independent of the particular local charts used in Definitions 4 and 5. We 
have not proved Brouwer’s theorem, but the invariance of interior points under dif- 
feomorphisms is well-known to us (a consequence of the inverse function theorem). 
Since it is diffeomorphisms that we shall be dealing with, we shall not digress here 
to discuss Brouwer’s theorem. 


3 This theorem asserts that under a homeomorphism i p : £ — > i p(E) of a set E C R" onto a set 
i p(E) C R" the interior points of E map to interior points of <p(E). 
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— n 

Example 3 The closed ball B ={ie I 1*1 < 1} or, as we say, the n-dimensional 
disk , is an n-dimensional manifold whose boundary is the (n — 1) -dimensional 
sphere S n ~ l = {te M" | \x\ = 1}. 

Remark 2 A manifold M having a nonempty set of boundary points is usually called 
a manifold with boundary, the term manifold (in the proper sense of the term) be- 
ing reserved for manifolds without boundary. In Definition 1 these cases are not 
distinguished. 

Proposition 1 The boundary 3 M of an n-dimensional manifold with boundary M 
is an (n — 1 ) -dimensional manifold without boundary. 

Proof Indeed, 3 H n — R" — 1 , and the restriction to 3 H n of a chart of the form ipi : 
H n — > U, belonging to an atlas of M generates an atlas of 3M. □ 

Example 4 Consider the planar double pendulum (Fig. 15.2) with arm a shorter 
than arm b, both being free to oscillate, except that the oscillations of b are limited in 
range by barriers. The configuration of such a system is characterized at each instant 
of time by the two angles a and f>. If there were no constraints, the configuration 
space of the double pendulum could be identified with the two-dimensional torus 
T 2 = S 1 a xS}. 

Under these constraints, the configuration space of the double pendulum is 
parametrized by the points of the cylinder 5^ x /j, where is the circle, corre- 
sponding to all possible positions of the arm a, and = {ft e R | \f}\ < A} is the 
interval within which the angle ft may vary, characterizing the position of the arm b. 

In this case we obtain a manifold with boundary. The boundary of this manifold 
consists of the two circles x {—A} and S ^ x {A}, which are the products of the 
circle S * and the endpoints {—A} and {A} of the interval /,! . 

Remark 3 It can be seen from Example 4 just considered that coordinates some- 
times arise naturally on M (a and f> in this example), and they themselves induce a 
topology on M . Hence, in Definition 1 of a manifold, it is not always necessary to 
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require in advance that M have a topology. The essence of the concept of a manifold 
is that the points of some set M can be parametrized by the points of a set of sub- 
domains of R" . A natural connection then arises between the coordinate systems 
that thereby arise on parts of M, expressed in the mappings of the corresponding 
domains of R" . Hence we can assume that M is obtained from a collection of do- 
mains of R" by exhibiting some rule for identifying their points or, figuratively 
speaking, exhibiting a rule for gluing them together. Thus defining a manifold es- 
sentially means giving a set of subdomains of R" and a rule of correspondence for 
the points of these subdomains. We shall not take the time to make this any more 
precise by formalizing the concept of gluing or identifying points, introducing a 
topology on M, and the like. 

Definition 6 A manifold is compact (resp. connected) if it is compact (resp. con- 
nected) as a topological space. 

The manifolds considered in Examples 1-4 are compact and connected. The 
boundary of the cylinder .S 2 x il in Example 4 consists of two independent cir- 
cles and is a one-dimensional compact, but not connected, manifold. The boundary 
S n ~' = dB of the /? -dimensional disk of Example 3 is a compact manifold, which 
is connected for n > 1 and disconnected (it consists of two points) if 77 = 1 . 

Example 5 The space R" itself is obviously a connected noncompact manifold 
without boundary, and the half-space H" provides the simplest example of a con- 
nected noncompact manifold with boundary. (In both cases the atlas can be taken to 
consist of the single chart corresponding to the identity mapping.) 

Proposition 2 If a manifold M is connected, it is path connected. 

Proof After fixing a point xo e M, consider the set E Xq of points of M that can be 
joined to xq by a path in M. The set E xo , as one can easily verify from the definition 
of a manifold, is both open and closed in M. But that means that E xo — M . □ 

2 

Example 6 If to each real n x n matrix we assign the point of R"“ whose coordi- 
nates are obtained by writing out the elements of the matrix in some fixed order, 
then the group GL(n, R) of nonsingular n x n matrices becomes a manifold of di- 
mension 77“ . This manifold is noncompact (the elements of the matrices are not 
bounded) and nonconnected. This last fact follows from the fact that GL(n , R) con- 
tains matrices with both positive and negative determinants. The points of GL(n , R) 
corresponding to two such matrices cannot be joined by a path. (On such a path 
there would have to be a point corresponding to a matrix whose determinant is 
zero.) 

Example 7 The group SO{ 2, R) of orthogonal mappings of the plane R 2 having de- 
terminant equal to 1 consists of matrices of the form ( cos “ sin “ ) and hence can be 
regarded as a manifold that is identified with the circle - the domain of variation of 
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the angular parameter a. Thus SO( 2, R) is a one-dimensional compact connected 
manifold. If we also allow reflections about lines in the plane R 2 , we obtain the 
group 0(2, R) of all real orthogonal 2x2 matrices. It can be naturally identified 
with two different circles, corresponding to matrices with determinants + 1 and — 1 
respectively. That is, 0(2, R) is a one-dimensional compact, but not connected man- 
ifold. 

Example 8 Let a be a vector in R 2 and 7 a the group of rigid motions of the plane 
generated by a. The elements of T a are translations by vectors of the form na, where 
n e Z. Under the action of the elements g of the group 7’ a each point x of the plane 
is displaced to a point g(x) of the form a' + na. The set of all points to which a given 
point x e R 2 passes under the action of the elements of this group of transformations 
is called its orbit. The property of points of R 2 of belonging to the same orbit is 
obviously an equivalence relation on R 2 , and the orbits are the equivalence classes 
of this relation. A domain in R 2 containing one point from each equivalence class 
is called a fundamental domain of this group of automorphisms (for a more precise 
statement see Problem 5d)). 

In the present case we can take as a fundamental domain a strip of width |a| 
bounded by two parallel lines orthogonal to a. We need only take into account that 
these lines themselves are obtained from each other through translations by a and 
-a respectively. Inside a strip of width less than |a| and orthogonal to a there are no 
equivalent points, so that all orbits having representatives in that strip are endowed 
uniquely with the coordinates of their representatives. Thus the quotient set R 2 / 7 a 
consisting of orbits of the group T a becomes a manifold. From what was said above 
about a fundamental domain, one can easily see that this manifold is homeomorphic 
to the cylinder obtained by gluing the boundary lines of a strip of width |a| together 
at equivalent points. 

Example 9 Now let a and b be a pair of orthogonal vectors of the plane R 2 and 
71, b the group of translations generated by these vectors. In this case a fundamental 
domain is the rectangle with sides a and b. Inside this rectangle the only equivalent 
points are those that lie on opposite sides. After gluing the sides of this fundamental 
rectangle together, we verify that the resulting manifold R 2 / 7 a |, is homeomorphic 
to the two-dimensional torus. 

Example 10 Now consider the group G a j, of rigid motions of the plane R 2 gener- 
ated by the transformations a(x, y) = (x + 1, 1 — y) and b(x, y) = (x , y + 1 ). 

A fundamental domain for the group G a ,b is the unit square whose horizon- 
tal sides are identified at points lying on the same vertical line, but whose vertical 
sides are identified at points symmetric about the center. Thus the resulting manifold 
SL 1 / G a .b turns out to be homeomorphic to the Klein bottle (see Sect. 12.1). 

We shall not take time to discuss here the useful and important examples studied 
in Sect. 12.1. 


326 


15 integration of Differential Forms on Manifolds 


15.2.2 Smooth Manifolds and Smooth Mappings 

Definition 7 An atlas of a manifold is smooth (of class C lk> or analytic ) if all the 
coordinate-changing functions for the atlas are smooth mappings (diffeomorphisms) 
of the corresponding smoothness class. 

Two atlases of a given smoothness (the same smoothness for both) are equivalent 
if their union is an atlas of this smoothness. 

Example 11 An atlas consisting of a single chart can be regarded as having any 
desired smoothness. Consider in this connection the atlas on the line R 1 generated 
by the identity mapping R 1 and a second atlas - generated 

by any strictly monotonic function R 1 3 id tp(x) e R 1 , mapping R 1 onto R 1 . 
The union of these atlases is an atlas having smoothness equal to the smaller of the 
smoothnesses of <p and tp ~ 1 . 

In particular, if tp(x) — x , then the atlas consisting of the two charts \x, x 3 } is 
not smooth, since (p~ l (x ) = x 1 / 3 . Using what has just been said, we can construct 
infinitely smooth atlases in R 1 whose union is an atlas of a preassigned smoothness 
class C®. 

Definition 8 A smooth manifold (of class C ik) or analytic ) is a manifold M with an 
equivalence class of atlases of the given smoothness. 

After this definition the following terminology is comprehensible: topological 
manifold (of class C (0) ), C® -manifold, analytic manifold. 

To give the entire equivalence class of atlases of a given smoothness on a mani- 
fold M it suffices to give any atlas A of this equivalence class. Thus we can assume 
that a smooth manifold is a pair (M, A), where M is a manifold and A an atlas of 
the given smoothness on M. 

The set of equivalent atlases of a given smoothness on a manifold is often called 
a structure of this smoothness on the manifold. There may be different smooth struc- 
tures of even the same smoothness on a given topological manifold (see Example 1 1 
and Problem 3). 

Let us consider some more examples in which our main attention is directed to 
the smoothness of the coordinate changes. 

Example 12 The one-dimensional manifold RP 1 called the real projective line, is 
the pencil of lines in R 2 passing through the origin, with the natural notion of dis- 
tance between two lines (measured, for example, by the magnitude of the smaller 
angle between them). Each line of the pencil is uniquely determined by a nonzero 
direction vector (x\x 2 ), and two such vectors give the same line if and only if 
they are collinear. Hence RP 1 can be regarded as a set of equivalence classes of 
ordered pairs (x 1 , x 2 ) of real numbers. Here at least one of the numbers in the pair 
must be nonzero, and two pairs are considered equivalent (identified) if they are pro- 
portional. The pairs (x',x 2 ) are usually called homogeneous coordinates on RP 1 . 
Using the interpretation of RP 1 in homogeneous coordinates, it is easy to construct 
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an atlas of two charts on RP 1 . Let Uj,i = 1 , 2 , be the lines (classes of pairs (x 1 , x 2 )) 
in RIP 1 for which x 1 ^ 0 . To each point (line) p e U i there corresponds a unique 
pair ( 1 , determined by the number t 2 = ^ T . Similarly the points of the region Uj 
are in one-to-one correspondence with pairs of the form ( ^ , 1 ) and are determined 
by the number Thus local coordinates arise in U\ and U2, which obviously 

correspond to the topology introduced above on RP 1 . In the common range U\ IT U2 
of these local charts the coordinates they introduce are connected by the relations 
t\ = (t 2 ) -1 and 1 2 = (L 1 ) -1 . which shows that the atlas is not only but even 
analytic. 

It is useful to keep in mind the following interpretation of the manifold RP 1 . 
Each line of the original pencil of lines is completely determined by its intersec- 
tion with the unit circle. But there are exactly two such points, diametrically op- 
posite to each other. Lines are near if and only if the corresponding points of the 
circle are near. Hence RP 1 can be interpreted as a circle with diametrically opposite 
points identified (glued together). If we take only a semicircle, there is only one 
pair of identified points on it, the end-points. Gluing them together, we again ob- 
tain a topological circle. Thus RP 1 is homeomorphic to the circle as a topological 
space. 


Example 13 If we now consider the pencil of lines passing through the origin in R 3 , 
or, what is the same, the set of equivalence classes of ordered triples of points 
(x x ,x , x 3 ) of real numbers that are not all three zero, we obtain the real pro- 
jective plane RP 2 . In the regions U\, U2, and U3 where x 1 ^ 0 , x 2 ^ 0 , x 3 ^ 
0 respectively, we introduce local coordinate systems (1, :L r , ;L r ) = ( I . y, r j ) ~ 
(t 2 , tj 3 ), (^, 1, = {t\, 1 ,t\) ~ (f|,?|), and (^, 1) = 1) ~ R3R3 2 ), 

which are obviously connected by the relations tj = (fj) _1 , tj = t^(t l k )~ , which 
apply in the common portions of the ranges of these charts. 

For example, the transition from (f 2 , f 3 ) to (t\ , r 3 ) in the domain U\ fl U2 is given 
by the formulas 


4 = ti) 


-1 


»!=*?■ (*?)"'• 


The Jacobian of this transformation is — (f 2 ) -3 , and since tj — ^y, it is defined 
and nonzero at points of the set U\ fl U2 under consideration. 

Thus RP 2 is a two-dimensional manifold having an analytic atlas consisting of 
three charts. 

By the same considerations as in Example 12 , where we studied the projective 
line RP 1 , we can interpret the projective plane RP 2 as the two-dimensional sphere 
S 2 C R 2 with antipodal points identified, or as a hemisphere, with diametrically 
opposite points of its boundary circle identified. Projecting the hemisphere into the 
plane, we obtain the possibility of interpreting RP 2 as a (two-dimensional) disk with 
diametrically opposite points of its boundary circle identified. 
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Example 14 The set of lines in the plane R 2 can be partitioned into two sets: U, 
the nonvertical lines, and V , the nonhorizontal lines. Each line in U has an equation 
of the form y = u\x + uj, and hence is characterized by the coordinates (u\ , u 2 ), 
while each line in V has an equation x = iq y + V 2 and is determined by coordinates 
(tq , iq). For lines in the intersection U fi V have the coordinate transformation iq = 
uf , V 2 = —U 2 uf l and u\ = uj” 1 , 112 — — tqnj -1 . Thus this set is endowed with an 
analytic atlas consisting of two charts. 

Every line in the plane has an equation ax + by + c = 0 and is characterized 
by a triple of numbers ( a , b, c), proportional triples defining the same line. For that 
reason, it might appear that we are again dealing with the projective plane RP 2 
considered in Example 13. However, whereas in RP 2 we admitted any triples of 
numbers not all zero, now we do not admit triples of the form (0, 0, c) where c ^ 0. 
A single point in RP 2 corresponds to the set of all such triples. Hence the manifold 
obtained in our present example is homeomorphic to the one obtained from RP 2 by 
removing one point. If we interpret RP 2 as a disk with diametrically opposite points 
of the boundary circle identified, then, deleting the center of the circle, we obtain, up 
to homeomorphism, an annulus whose outer circle is glued together at diametrically 
opposite points. By a simple incision one can easily show that the result is none 
other than the familiar Mobius band. 

Definition 9 Let M and N be C^’ -manifolds. A mapping / : M — > /V i s l-smooth 
(a C^-mapping) if the local coordinates of the point f(x) e N are C- 21 -functions 
of the local coordinates of x e M. 

This definition has an unambiguous meaning (one that is independent of the 
choice of local coordinates) if / < k. 

In particular, the smooth mappings of M into R 1 are smooth functions on M, and 
the smooth mappings of R 1 (or an interval of R 1 ) into M are smooth paths on M. 

Thus the degree of smoothness of a function / : M -> N on a manifold M cannot 
exceed the degree of smoothness of the manifold itself. 


15.2.3 Orientation of a Manifold and Its Boundary 

Definition 10 Two charts of a smooth manifold are consistent if the transition from 
the local coordinates in one to the other in their common range is a diffeomorphism 
whose Jacobian is everywhere positive. 

In particular, if the ranges of two local charts have empty intersection, they are 
considered consistent. 

Definition 11 An atlas A of a smooth manifold (M, A) is an orienting atlas of M 
if it consists of pairwise consistent charts. 
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Definition 12 A manifold is orientable if it has an orienting atlas. Otherwise it is 
nonorientable. 

Two orienting atlases of a manifold will be regarded as equivalent (in the sense 
of the question of orientation of the manifold considered just now) if their union is 
also an orienting atlas of the manifold. It is easy to see that this relation really is an 
equivalence relation. 

Definition 13 An equivalence class of orienting atlases of a manifold in the relation 
just defined is called an orientation class of atlases of the manifold or an orientation 
of the manifold. 

Definition 14 An oriented manifold is a manifold with this class of orientations of 
its atlases, that is, with a fixed orientation on the manifold. 

Thus orienting the manifold means exhibiting (by some means or other) a certain 
orientation class of atlases on it. To do this, for example, it suffices to exhibit any 
specific orienting atlas from the orientation class. 

Various methods used in practice to define an orientation of manifolds embedded 
in M" are described in Sects. 12.2 and 12.3. 

Proposition 3 A connected manifold is either nonorientable or admits exactly two 
orientations. 

Proof Let A and A be two orienting atlases of the manifold M with diffeomorphic 
transitions from the local coordinates of charts of one to charts of the other. Assume 
that there is a point po e M and two charts of these atlases whose ranges C/,- 0 and 
Ui 0 contain po\ and suppose the Jacobian of the change of coordinates of the charts 
at points of the parameter space corresponding to the point po is positive. We shall 
show that then for every point p e M and any charts of the atlases A and A whose 
ranges contain p the Jacobian of the coordinate transformation at corresponding 
coordinate points is also positive. 

We begin by making the obvious observation that if the Jacobian of the transfor- 
mation is positive (resp. negative) at the point p for any pair of charts containing 
p in the atlases A and A, then it is positive (resp. negative) at p for any such pair 
of charts, since inside each given atlas the coordinate transformations occur with 
positive Jacobian, and the Jacobian of a composition of two mappings is the product 
of the Jacobians of the individual mappings. 

Now let E be the subset of M consisting of the points p e M at which the coor- 
dinate transformations from the charts of one atlas to those of the other have positive 
Jacobian. 

The set E is nonempty, since po e E. The set E is open in M. Indeed, for every 
point p e E there exist ranges tJ, and Uj of certain charts of the atlases A and A 
containing p. The sets tJ, and Uj are open in M, so that the set U t fl Uj is open 
in M. On the connected component of the set U, fl U j containing p, which is open 
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in Ui fl U j and in M, the Jacobian of the transformation cannot change sign without 
vanishing at some point. That is, in some neighborhood of p the Jacobian remains 
positive, which proves that E is open. But E is also closed in M. This follows from 
the continuity of the Jacobian of a diffeomorphism and the fact that the Jacobian of 
a diffeomorphism never vanishes. 

Thus E is a nonempty open-closed subset of the connected set M. Hence E — M, 
and the atlases A and A define the same orientation on M. 

Replacing one coordinate, say f 1 by — f 1 in every chart of the atlas A, we obtain 
the orienting atlas — A belonging to a different orientation class. Since the Jacobians 
of the coordinate transformations from an arbitrary chart to the charts of A and — A 
have opposite signs, every atlas that orients M is equivalent either to A or to — A. □ 

Definition 15 A finite sequence of charts of a given atlas will be called a chain of 
charts if the ranges of any pair of charts having adjacent indices have a nonempty 
intersection ((/,- fl t/,-+ 1 ^ 0). 

Definition 16 A chain of charts is contradictory or disorienting if the Jacobian of 
the coordinate transformation from each chart in the chain to the next is positive 
and the ranges of the first and last charts of the chain intersect, but the coordinate 
transformation from the last to the first has negative Jacobian. 

Proposition 4 A manifold is orientable if and only if there does not exist a contra- 
dictory chain of charts on it. 

Proof Since every manifold decomposes into connected components whose orienta- 
tions can be defined independently, it suffices to prove Proposition 4 for a connected 
manifold M. 

Necessity. Suppose the connected manifold M is orientable and A is an atlas defin- 
ing an orientation. From what has been said and Proposition 3, every smooth local 
chart of the manifold M connected with the charts of the atlas A is either consistent 
with all the charts of A or consistent with all the charts of — A. This can easily be 
seen from Proposition 3 itself, if we restrict charts of A to the range of the chart we 
have taken, which can be regarded as a connected manifold oriented by one chart. It 
follows from this that there is no contradictory chain of charts on M. 

Sufficiency. It follows from Definition 1 that there exists an atlas on the manifold 
consisting of a finite or countable number of charts. We take such an atlas A and 
number its charts. Consider the chart (Ui,tp\) and any chart ( Ui . q>i ) such that 
U i fl Ui f- 0. Then the Jacobians of the coordinate transformations tpu and <pn 
are either everywhere negative or everywhere positive in their domains of defini- 
tion. The Jacobians cannot have values of different signs, since otherwise one could 
exhibit connected subsets [/_ and U + in U\ U if where the Jacobian is negative and 
positive respectively, and the chain of charts (U i, tp\), (t/+, ^i), ([//, <fii ) , (t/_, tpf) 
would be contradictory. 
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Thus, changing the sign of one coordinate if necessary in the chart (t/,-, (p,), we 
could obtain a chart with the same range Uj and consistent with (U\,tp\). After that 
procedure, two charts ({/,-, <pi) and (Uj, <pj) such that £/i fl t/,- ^ 0, t/i fl ^ 0, 
Ui n (// / 0 are themselves consistent: otherwise we would have constructed a 
contradictory chain of three charts. 

Thus, all the charts of an atlas whose ranges intersect U\ can now be considered 
consistent with one another. Taking each of those charts now as the standard, one 
can adjust the charts of the atlas not covered in the first stage so that they are con- 
sistent. No contradictions arise when we do this, since by hypothesis, there are no 
contradictory chains on the manifold. Continuing this process and taking account of 
the connectedness of the manifold, we construct on it an atlas consisting of pairwise 
consistent charts, which proves the orientability of the manifold. □ 

This criterion for orientability of the manifold, like the considerations used in its 
proof, can be applied to the study of specific manifolds. Thus, the manifold RIP 1 
studied in Example 12 is orientable. From the atlas shown there it is easy to obtain 
an orienting atlas of Mil 1 ' 1 . To do this, it suffices to reverse the sign of the local 
coordinates of one of the two charts constructed there. However, the orientability of 
the projective line RP 1 obviously also follows from the fact that the manifold RP 1 
is homeomorphic to a circle. 

The projective plane RP 2 is nonorientable: every pair of charts in the atlas con- 
structed in Example 13 is such that the coordinate transformations have domains 
of positivity and domains of negativity of the Jacobian. As we saw in the proof of 
Proposition 4, it follows from this that a contradictory chain of charts on RP 2 exists. 

For the same reason the manifold considered in Example 14 is nonorientable, 
which, as was noted, is homeomorphic to a Mobius band. 

Proposition 5 The boundary of an orientable smooth n-dimensional manifold is an 
orientable (n — 1 ) -dimensional manifold admitting a structure of the same smooth- 
ness as the original manifold. 

Proof The proof of Proposition 5 is a verbatim repetition of the proof of the analo- 
gous Proposition 2 of Sect. 12.3.2 for surfaces embedded in R" . □ 

Definition 17 If A(M ) = {(//", cp,, Ui)} U {(R", <pj , Uj)} is an atlas that orients 
the manifold M, then the charts A(dM) — {(R' !_1 , <p,j aw „ =R „-i , 3 [/, )} provide an 
orienting atlas for the boundary 3 M of M. The orientation of the boundary defined 
by this atlas is called the orientation of the boundary induced by the orientation of 
the manifold. 

Important techniques for defining the orientation of a surface embedded in R" 
and the induced orientation of its boundary, which are frequently used in practice, 
were described in detail in Sects. 12.2 and 12.3. 
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15.2.4 Partitions of Unity and the Realization of Manifolds 
as Surfaces in M" 

In this subsection we shall describe a special construction called a partition of unity. 
This construction is often the basic device for reducing global problems to local 
ones. Later on we shall demonstrate it in deriving Stokes’ formula on a manifold, 
but here we shall use the partition of unity to clarify the possibility of realizing any 
manifold as a surface in R" of sufficiently high dimension. 

Lemma One can construct a function f e C^°°)(R, R) on R such that fix) = 0 
for \x\ > 3, f(x) = l for |x| < 1, and 0 < fix) < l for 1 < \x\ < 3. 

Proof We shall construct one such function using the familiar function gix) — 
e ( ~ l,x ' for Xyto, Previously (see Exercise 2 of Sect. 5.2) we verified that g e 

0 for x=0. 

C(°°) R) by showing that g (n) (0) = 0 for every value n e N. 

In such a case the nonnegative function 

[ e - ^ -1 - 1 2 • e _ ^- T+1) 2 for |x| < 1, 

Gix) = \ 

[ 0 for |x| > 1 

also belongs to C^ 00 -* (R, R), and along with it the function 

x . r+oo 

Git) At / G(r)dr+ 

OO J — OO 

belongs to this class, since F'ix) — Gix)/ jff. G(r) dr. 

The function F is strictly increasing on [—1, 1], Fix) = 0 for x < —1, and 
Fix) = 1 for x > 1. 

We can now take the required function to be 

fix) = Fix + 2) + Fi~x — 2) — 1. □ 

Remark If / : R -> R is the function constructed in the proof of the lemma, then 
the function 

0(x l , . . . , x 11 ) — fix 1 — a 1 ) • . . . • f{x n — a") 

defined in R" is such that 6 e C (00, (R", R), 0 < 6 < 1, at every point x e R", 

6ix) = 1 on the interval /(a) = {x e R” | \x l — a 1 \ < 1 , i — 1, . . . , n], and the 

support supp0 of the function 0 is contained in the interval /(a) = {x e R" | 
\x l — a 1 1 < 3, i = 1, . . . , /?}. 



Definition 18 Let M be a C^-manifold and X a subset of M. The system E — 
{e a ,a e A} of functions e a e C® (M, R) is a C lkl partition of unity on X if 
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l°0<e a (x)<l for every function e a e E and every x e M; 

2° each point x e X has a neighborhood U (x) in M such that all but a finite 
number of functions of E are identically zero on U (x); 

3 ° Ee a s E e c<(x)=l on X. 

We remark that by condition 2° only a finite number of terms in this last sum are 
nonzero at each point x e X. 

Definition 19 Let O — {op, ft e B\ be an open covering of X C M. We say that 
the partition of unity E = { e a , a e A} on X is subordinate to the covering O if the 
support of each function in the system E is contained in at least one of the sets of 
the system O . 

Proposition 6 Let {( Uj,(pj),i — 1 be a finite set of charts of some C <k> 

atlas of the manifold M, whose ranges Uj, i — 1, . . . , m.form a covering of a com- 
pact set K C M . Then there exists a C® partition of unity on K subordinate to the 
covering [Uj,i = 1 , ... ,m}. 

Proof For any point xq e K we first carry out the following construction. We choose 
successively a domain U, containing xq corresponding to a chart <pj : R" -* Uj (or 
( pi : H n —r Uj), the point to = <pft l (x o) e R" (or H n ), the function 9{t — to) (where 
0(f) is the function shown in the remark to the lemma), and the restriction 6, 0 of 
9(t — to) to the parameter domain of (fij . 

Let I tQ be the intersection of the unit cube centered at to e R" with the pa- 
rameter domain of ip\. Actually 9 to differs from 9(t — to) and I tQ differs from the 
corresponding unit cube only when the parameter domain of the chart <pt is the 
half-space H” . The open sets (pi ( ft ) constructed at each point x e K and the point 
f = <pft ] (x), taken for all admissible values of /' = 1,2, ... ,m, form an open cov- 
ering of the compact set K. Let \<pi ] ( l tj ) , j = 1, 2, ...,/} be a finite covering of K 
extracted from it. It is obvious that (p, , ( l tj ) c Ujj . We define on L, ( the function 
9j(x) — 9 tj o(p~\x). We then extend 9j(x) to the entire manifold M by setting the 
function equal to zero outside Uj j . We retain the previous notation 9/ for this func- 
tion extended to M. By construction 9j e C^ k \M, R), supply C Ujj , 0 < 9j(x) < I 
on M, and 9/(x) = 1 on < pijUtj ) C Ujj. Then the functions ei(x) — 0\ (x), e 2 (x) = 
02(x)(l — 0i(x)), ...,e/(x) = 9/(x) ■ (l — 9i-i(xf) ■ . . .■ (1 — 0\(x)) form the required 
partition of unity. We shall verify only that Y^j = t e j( x ) = 1 on K . since the system 
of functions {ei, . . . , eft\ obviously satisfies the other conditions required of a parti- 
tion of unity on K subordinate to the covering \(Jj l , , Lf } C [Uj,i = l, ... ,m}. 
But 

l 

1 - ^]e ; (x) = (1- 01 (x)) •...•(!- 01 (x)) = 0 on K, 

7 = 1 

since each point x e K is covered by some set (p i] ( /, ( ) on which the corresponding 
function 9j is identically equal to 1 . □ 
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Corollary 1 If M is a compact manifold and A a C (k) atlas on M , then there exists 
a finite partition of unity {ei , ,e/} on M subordinate to a covering of the manifold 
by the ranges of the charts of A. 

Proof Since M is compact, the atlas A can be regarded as finite. We now have the 
hypotheses of Proposition 6, if we set K — M in it. □ 

Corollary 2 For every compact set K contained in a manifold M and every open 
set G C M containing K, there exists a function f : M — >■ R with smoothness equal 
to that of the manifold and such that f(x) = 1 on K and supp / C G. 

Proof Cover each point x e K by a neighborhood U(x) contained in G and inside 
the range of some chart of the manifold M. From the open covering { U (x). x e K } 
of the compact set K extract a finite covering, and construct a partition of unity 
{ei , . . . , ei) on K subordinate to it. The function / = JL =1 e,- is the one required. □ 

Corollary 3 Eveiy ( abstractly defined ) compact smooth n-dimensional manifold M 
is diffeomorphic to some compact smooth surface contained in R w of sufficiently 
large dimension N. 

Proof So as not to complicate the idea of the proof with inessential details, we carry 
it out for the case of a compact manifold M without boundary. In that case there is 
a finite smooth atlas A = {(p, : I U /, i = \ , m } on ;V/, where I is an open n- 
dimensional cube in R". We take a slightly smaller cube I ' such that / ' C / and the 
set {U- = (pi ( /')■ i = 1, . . . , m) still forms a covering of M. Setting K = I' , G = I , 
and M — R" in Corollary 2, we construct a function / e C (oo:, (R", R) such that 
f (t) = 1 for t e I' and supp / c I . 

We now consider the coordinate functions tj (x), , t" (x) of the mappings <p~ 1 : 
U] — »■ I, i = 1 in. and use them to introduce the following function on M: 

k, , \ (/ ° <Pi l m ■ (*) for x eUi, 

yf(x) = \ 

[0 for xiUj, 

i = 1, . . . , m; k = 1 , ... ,n. 

At every point x e M the rank of the mapping M3.m y (x ) — (yj , y", , 
y} n , . . . , y",)(x) e R'" " is maximal and equal to n. Indeed, if x e U-, then < pf l (x) = 

(e/',/o <p7 l (x) = 1, and yf((pi(t)) = tf,k = 1 n. 

If finally, we consider the mapping M 3 x m* Y( x) = (y(x), f o tpf 1 (x), . . . , / o 
q>~ l {x)) e K m ”+ m ; setting / o <pf l (x) = 0 outside Uj,i = \ .... ,m, then this map- 
ping, on the one hand will obviously have the same rank n as the mapping x \- -+ y(x); 
on the other hand it will be demonstrably a one-to-one mapping of M onto the im- 
age of M in ]gj"'"+ l " . Let us verify this last assertion. Let p,q be different points 
of M. We find a domain U- from the system {U{, i = 1 , ... ,m} covering M that 
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contains the point p. Then / o <p. l {p) = 1. If / o <p. 1 ( q ) < 1, then Y (p) T (g). 
If / o p- 1 ( 9 ) = 1, then p, qeUi, y*(p) = t k (p), y k (q ) = t k (q), and t k (p) ± t k (q) 
for at least one value of k e {1 n}. That is, Y (p) f^Y(q) in this case. □ 

For information on the general Whitney embedding theorem for an arbitrary 
manifold as a surface in K" the reader may consult the specialized geometric lit- 
erature. 


15.2.5 Problems and Exercises 

1. Verify that the object (a manifold ) introduced by Definition 1 does not change if 
we require only that each point x e M have a neighborhood U(x) C M homeomor- 
phic to an open subset of the half-space H " . 

2. Show that 

a) the manifold GL(n , R) of Example 6 is noncompact and has exactly two con- 
nected components; 

b) the manifold SO(n, M) (see Example 7) is connected; 

c) the manifold 0(n, R) is compact and has exactly two connected components. 

3. Let (M, A) and ( M , A) be manifolds with smooth structures of the same degree 
of smoothness C ik> on them. The smooth manifolds (M, A) and (M, A) ( smooth 
structures ) are considered isomorphic if there exists a C® mapping / : M —>■ M 
having a C® inverse / -1 : M — > M in the atlases A, A. 

a) Show that all structures of the same smoothness on R 1 are isomorphic. 

b) Verify the assertions made in Example 11, and determine whether they con- 
tradict a). 

c) Show that on the circle S 1 (the one-dimensional sphere) any two C (0 °^ struc- 
tures are isomorphic. We note that this assertion remains valid for spheres of dimen- 
sion not larger than 6, but on S 7 , as Milnor 4 has shown, there exist nonisomorphic 
C (00) structures. 

4. Let S' be a subset of an n -dimensional manifold M such that for every point 
xq e S there exists a chart x = cp(t ) of the manifold M whose range U contains xq, 
and the ^-dimensional surface defined by the relations t k+l = 0 , ... ,t n =0 corre- 
sponds to the set S fl U in the parameter domain t = (f 1 , . . . , f") of tp. In this case 
S is called a k-dimensional submanifold of M. 

a) Show that a A-dimensional manifold structure naturally arises on S, induced 
by the structure of M and having the same smoothness as the manifold M. 

b) Verify that the ^-dimensional surfaces S in R" are precisely the ^-dimensional 
submanifolds of R" . 


4 J. Milnor (b. 1931) - one of the most outstanding modem American mathematicians; his main 

works are in algebraic topology and the topology of manifolds. 


336 


15 * Integration of Differential Forms on Manifolds 


c) Show that under a smooth homeomorphic mapping / : R 1 T 2 of the line 
R 1 into the torus T 2 the image /(R 1 ) may be an everywhere dense subset of T 2 
and in that case will not be a one-dimensional submanifold of the torus, although it 
will be an abstract one-dimensional manifold. 

d) Verify that the extent of the concept “submanifold” does not change if we 
consider S C M a ft-dimensional submanifold of the n -dimensional manifold M 
when there exists a local chart of the manifold M whose range contains xq for every 
point xq e S and some ^-dimensional surface of the space R" corresponds to the set 
S f! U in the parameter domain of the chart. 

5. Let X be a Hausdorff topological space (manifold) and G the group of homeo- 
morphic transformations of X. The group G is a discrete group of transformations 
of X if for every two (possibly equal) points xi,xj e X there exist neighborhoods 
U\ and LL of them respectively, such that the set {g e G \ g{U\) fl U 2 ^ 0} is finite. 

a) It follows from this that the orbit (g(^) e X \ g e G} of every point x e X is 
discrete, and the stabilizer G x = {g e G \ g(x) = x] of every point xeXis finite. 

b) Verify that if G is a group of isometries of a metric space, having the two 
properties in a), then G is a discrete group of transformations of X. 

c) Introduce the natural topological space (manifold) structure on the set X/G 
of orbits of the discrete group G . 

d) A closed subset F of the topological space (manifold) X with a discrete group 
G of transformations is a fundamental domain of the group G if it is the closure of 
an open subset of X and the sets g(F), where g e G. have no interior points in 
common and form a locally finite covering of X. Show using Examples 8-10 how 
the quotient space X/G (of orbits) of the group G can be obtained from F by 
“gluing” certain boundary points. 

6. a) Using the construction of Examples 12 and 13, construct n -dimensional pro- 
jective space RP" . 

b) Show that RP" is orientable if n is odd and nonorientable if n is even. 

c) Verify that the manifolds 5(9(3, R) and RP 3 are homeomorphic. 

7. Verify that the manifold constructed in Example 14 is indeed homeomorphic to 
the Mobius band. 

8. a) A Lie group 5 is a group G endowed with the structure of an analytic manifold 
such that the mappings (g i, g 2 ) >->■ gi ■ g 2 and g >->■ g 1 are analytic mappings of 
G x G and G into G. Show that the manifolds in Examples 6 and 7 are Lie groups. 

b) A topological group (or continuous group) is a group G endowed with the 
structure of a topological space such that the group operations of multiplication and 
inversion are continuous as mappings GxG->G, and G — > G in the topology 
of G. Using the example of the group Q of rational numbers show that not every 
topological group is a Lie group. 


5 S. Lie (1842-1899) - outstanding Norwegian mathematician, creator of the theory of continuous 
groups (Lie groups), which is now of fundamental importance in geometry, topology, and the math- 
ematical methods of physics; one of the winners of the International Lobachevskii Prize (awarded 
in 1897 for his work in applying group theory to the foundations of geometry). 
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c) Show that every Lie group is a topological group in the sense of the definition 
given in b). 

d) It has been proved 6 that every topological group G that is a manifold is a 
Lie group (that is, as a manifold G admits an analytic structure in which the group 
becomes a Lie group). Show that every group manifold (that is, every Lie group) is 
an orientable manifold. 

9. A system of subsets of a topological space is locally finite if each point of the 
space has a neighborhood intersecting only a finite number of sets in the system. In 
particular, one may speak of a locally finite covering of a space. 

A system of sets is said to be a refinement of a second system if every set of the 
first system is contained in at least one of the sets of the second system. In particular 
it makes sense to speak of one covering of a set being a refinement of another. 

a) Show that every open covering of R" has a locally finite refinement. 

b) Solve problem a) with R" replaced by an arbitrary manifold M. 

c) Show that there exists a partition of unity on R" subordinate to any preas- 
signed open covering of R" . 

d) Verify that assertion c) remains valid for an arbitrary manifold. 


15.3 Differential Forms and Integration on Manifolds 
15.3.1 The Tangent Space to a Manifold at a Point 

y 

We recall that to each smooth path R 9 1 1 — > x(t) e R" (a motion in R") passing 
through the point xo = x(fo) € R" at time to we have assigned the instantaneous 
velocity vector £ = (f . . . , £") : f (t) = x(f) = (x 1 , . . . , x")(fo). The set of all such 
vectors f attached to the point xo e R" is naturally identified with the arithmetic 
space R" and is denoted T R" (or Tv 0 (R" )). In T R" o one introduces the same vector 
operations on elements § e TR" as on the corresponding elements of the vector 
space R" . In this way a vector space T R" 0 arises, called the tangent space to R" at 
the point xq € R" . 

Forgetting about motivation and introductory considerations, we can now say 
that formally T R" 0 is a pair (xo, R") consisting of a point xo e R" and a copy of the 
vector space R" attached to it. 

Now let M be a smooth n -dimensional manifold with an atlas A of at least C (1) 
smoothness. We wish to define a tangent vector § and a tangent space TM po to the 
manifold Mata point po e M. 

To do this we use the interpretation of the tangent vector as the instantaneous ve- 

y 

locity of a motion. We take a smooth path R" 3 1 1 — > p(t ) e M on the manifold M 
passing through the point po = p(to) e M at time to- The parameters of charts (that 


6 This is the solution to Flilbert’s fifth problem. 
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is, local coordinates) of the manifold M will be denoted by the letter x here, with the 
subscript of the corresponding chart and a superscript giving the number of the co- 
ordinate. Thus, in the parameter domain of each chart (C/, , (pi) whose range U, con- 
tains po, the path 1 1 — > (p j o p(t) = Xj(t) e R" (or H n ) corresponds to the path y. 

y 

This path is smooth by definition of the smooth mapping Rsi i — > p(t) e M. 

Thus, in the parameter domain of the chart ([/;, tpf), where (p, is a mapping p = 
(Pi(xi), there arises a point x,(fo) = tp~ X (po ) and a vector = i/(fo) £ TR" (fo) . 
In another such chart (Uj , <pj) these objects will be respectively the point Xj(to ) = 
(pj'(po) and the vector t-j — Xj(to) e T M". ( It is natural to regard these as the 
coordinate expressions in different charts of what we would like to call a tangent 
vector § to the manifold M at the point po e M . 

Between the coordinates x, and xj there are smooth mutually inverse transition 
mappings 

Xi = cpji(xj), Xj = (Pij (Xi), (15.30) 

as a result of which the pairs (x, (to), £;), ( Xj (to), f /) turn out to be connected by the 
relations 


Xi (to) = tp ji (xj (to)) , x j (to) = (Pij (x; (to)) , 

Hi = <p'ji ( x j (to))Hj , Hj= <Pij ( x i (to)) Hi ■ 


Equality (15.32) obviously follows from the formulas 


(15.31) 

(15.32) 


Xi (t) = (p' ji (xj(t))xj( t), Xj(t) = <pij(xi(t))xi(t), 


obtained from (15.30) by differentiation. 

Definition 1 We shall say that a tangent vector H to the manifold M at the point 
p e M is defined if a vector is fixed in each space TM" tangent to R" at the point 
x, corresponding to p in the parameter domain of a chart (U\, q>i), where Uj 3 p, in 
such a way that (15.32) holds. 

If the elements of the Jacobian matrix tpU of the mapping (pji are written out 

dx^ 

explicitly as w-k, we find the following explicit form for the connection between 

dXj 

the two coordinate representations of a given vector £ : 

" dx k 

k=\,2,...,n, (15.33) 

m= 1 J 

where the partial derivatives are computed at the point Xj = cpj l (p) corresponding 
to p. 

We denote by TM p the set of all tangent vectors to the manifold M at the point 
p e M. 
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Definition 2 If we introduce a vector-space structure on the set TM p by identifying 
TM p with the corresponding space T R" (or 777" ), that is, the sum of vectors in 
TM p is regarded as the vector whose coordinate representation in 7’R". (or 777".) 
corresponds to the sum of the coordinate representations of the terms, and multipli- 
cation of a vector by a scalar is defined analogously, the vector space so obtained is 
usually denoted either TM p or T p M , and is called the tangent space to the manifold 
M at the point p e M . 

It can be seen from formulas (15.32) and (15.33) that the vector-space structure 
introduced in TM p is independent of the choice of individual chart, that is, Defini- 
tion 2 is unambiguous in that sense. 

Thus we have now defined the tangent space to a manifold. There are various 
interpretations of a tangent vector and the tangent space (see Problem 1). For exam- 
ple, one such interpretation is to identify a tangent vector with a linear functional. 
This identification is based on the following observation, which we make in R" . 

Each vector £ e 7’R" is the velocity vector corresponding to some smooth path 
x = x(t), that is, £ = x{t)\ t —t 0 with xo = x(?o). This makes it possible to define the 
derivative D^f(x o) of a smooth function / defined on R" (or in a neighborhood of 
xo) with respect to the vector § e T R" o . To be specific, 

T>?/(*o) : =^(/°x)(0| r=fo , (15.34) 


that is, 

T>f/(*o) = /'(*o)£, (15-35) 

where /'(xo) is the tangent mapping to / (the differential of/) at a point xo- 

The functional D j : C (1) (R",R) — > R assigned to the vector § e TR" Q by the 
formulas (15.34) and (15.35) is obviously linear with respect to /. It is also clear 
from (15.35) that for a fixed function / the quantity fix o) is a linear function 
of f , that is, the sum of the corresponding linear functionals corresponds to a sum 
of vectors, and multiplication of a functional I f by a number corresponds to multi- 
plying the vector f by the same number. Thus there is an isomorphism between the 
vector space 7’R" 0 and the vector space of corresponding linear functionals I f . It 
remains only to define the linear functional // by exhibiting a set of characteristic 
properties of it, in order to obtain a new interpretation of the tangent space 7’R" o , 
which is of course isomorphic to the previous one. 

We remark that, in addition to the linearity indicated above, the functional // 
possesses the following property: 

D$if • g)(*o) = Dg f (xo) ■ g(x 0 ) + f{x 0 ) • Dfg(x o). (15.36) 

This is the law for differentiating a product. 

In differential algebra an additive mapping a m* a' of a ring A satisfying 
the relation (a ■ b)' = a' ■ b + a ■ // is called derivation (more precisely deriva- 
tion of the ring A). Thus the functional D % : C (1, (R",R) is a derivation of the 
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ring C (1) (R", M). But D | is also linear relative to the vector-space structure of 
C (1) (R'\]R). 

One can verify that a linear functional / : C <0 °^ (M" , M) — >■ R. possessing the prop- 
erties 


Haf + Pg) = aKf) + pi(g), (15.37) 

Kf ■ g) = Kf)g(x o) + f(xo)l{g), (15.38) 


has the form D £ , where § e T R" 0 . Thus the tangent space TM” to M" at xo can be 
interpreted as a vector space of functionals (derivations) on C- 00 ' (R" , K) satisfying 
conditions (15.37) and (15.38). 

The functions D ek f(x o) = f(x)\ x - XQ that compute the corresponding partial 
derivative of the function / at xo correspond to the basis vectors e \, . . . , e„ of the 
space TM" o . Thus, under the functional interpretation of TM" o one can say that the 
functionals { , . . . , } \ x =x 0 form a basis of TK” o . 

If £ = (f , £”) e TK" 0 , then the operator corresponding to the vector ^ 
has the form D % — . 

In a completely analogous manner the tangent vector £ to an n -dimensional C ,fX)) 
manifold M at a point po e M can be interpreted (or defined) as the element of 
the space of derivations I on C (co HM , R) having properties (15.37) and (15.38), 
xo of course being replaced by po in relation (15.38), so that the functional I is 
connected with precisely the point po e M . Such a definition of the tangent vector 
f and the tangent space TM po does not formally require the invocation of any local 
coordinates, and in that sense it is obviously invariant. In coordinates (x 1 , . . . , x") 
of a local chart ([/,-, q>i) the operator / has the form + • ■ • + £"gfn = D^. 

The numbers (§1, . . . , £") are naturally called the coordinates of the tangent vector 
l e TM po in coordinates of the chart ( U (pi). By the laws of differentiation, the 
coordinate representations of the same functional / e TM po in the charts ((/;, pi), 
(U /, tpj) are connected by the relations 


n 


Eft* 



E 

m = 1 


t m 9 

■' dxf 


n { n 

EE 

k= 1 \m=l 





(15.33') 


which of course duplicate (15.33). 


15.3.2 Differential Forms on a Manifold 

Let us now consider the space T*M p conjugate to the tangent space TM p , that is, 
T*M p is the space of real- valued linear functionals on TM p . 

Definition 3 The space T*M p conjugate to the tangent space TM p to the manifold 
M at the point p e M is called the cotangent space to M at p. 
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If the manifold M is a C (o °- manifold, / e C^°°\M, R), and 1% is the derivation 
corresponding to the vector § e TMq, then for a fixed / e C (oo \M, R) the mapping 
£ i- -* / will obviously be an element of the space T*M p . In the case M — R" we 

obtain § h->- D^f(p) — f'(p)%, so that the resulting mapping f h->- Igf is naturally 
called the differential of the function / at p, and is denoted by the usual symbol 

d f(p). 

If TR"_ 1( . ^ (or TH j when p e 3 M) is the space corresponding to the 
tangent space TM p in the chart (U a , (p a ) on the manifold M, it is natural to re- 
gard the space 7’*R"_, conjugate to T R 7 , as the representative of the space 

Wot ( P ) <Pa ( P ) 

T*M p in this local chart. In coordinates (x^,...,x”) of a local chart ( U a ,(p a ) 
the dual basis {dx 1 , . . . , dx"} in the conjugate space corresponds to the basis 
stt} of TR", ( orTH n _ , if p e 3M). We recall that dx' (£) = £', so 

dx a ° x a (p a ( p ) <p a ip) 

that dx ! (g|j) = S‘j. The expressions for these dual bases in another chart (Up, tpp) 
may turn out to be not so simple, for -A- = — f , dx' = —f dx i . 

dx J p dx J p ° x a dx J p p 

Definition 4 We say that a differential form a> m of degree m is defined on an n- 
dimensional manifold M if a skew-symmetric form co m (p) : ( TM p ) m — » R is de- 
fined on each tangent space TM p to M, p e M. 

In practice this means only that a corresponding m- form oj a (x a ) , where x a = 
<pZ l (p), is defined on each space T R" , (or 77/" , ) corresponding to TMq 

<Pa (p) <Pu iP) 

in the chart (U a , <p a ) of the manifold M. The fact that two such forms co a (x a ) and 
u>p(xp) are representatives of the same form w(p) can be expressed by the relation 


«<* (*«)((§ l)a. • • • , (?m)a) = Up(xp)((%i) p , (15.39) 

in which x a and xp are the representatives of the point p e M, and (£ 1 ) 0 -, . . . , (£ m ) a 
and (f\)p , . . . , ((;„,) p are the coordinate representations of the vectors £ 1 , . . . , € 

TM p in the charts ( U a , cp a ), (Up, cpp) respectively. 

In more formal notation this means that 

X a = <Pp a (xp), xp = <p a p(x a ), ( 15 . 31 ') 

Ha = <p'p a (xp)Hp, Hp = <p' a p(x a )^ a , ( 15 . 32 ') 

where, as usual, (pp a and (p a p are respectively the functions tpf 1 o cpp and (p^ 1 oq> a 
for the coordinate transitions, and the tangent mappings to them cp'n ((pp a )*. 


(p' a p =: (< p a p )* provide an isomorphism of the tangent spaces to R" (or H") at the 
corresponding points x a and xp. As stated in Sect. 15.1.3, the adjoint mappings 
(< p'p a )* =: q>*p a and (<p' a a)* Vap provide the transfer of the forms, and the relation 
(15.39) means precisely that 

co a (x a ) = (p*p(x a )cop(xp), (15.39') 

where a and /I are indices (which can be interchanged). 
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The matrix (cj ) of the mapping <p' a p(x a ) is known: (cj ) = (^f )( x a )■ Thus, if 
OJ a (Xa) = a, 'i i,„ d. 4 1 A " ■ A dx l a m (15.40) 




and 


<op(xp)= b h, -,jm ^p A • • ■ A dx'p" , 

1 Sh<—<jm<n 

then according to Example 7 of Sect. 15.1 we find that 

E a h-im dx a A “- Adx a m = 


l<i'l <--</ m <n 


E * 


7l •••7m 


l<i'l <--<i m <n 


d{x J p \...,x J p 
d(x‘d , x' a m ) 


(x a ) dxf A • • • A dx' 1 " 


(15.41) 


(15.42) 


where as always, denotes the determinant of the matrix of corresponding partial 
derivatives. 

Thus different coordinate expressions for the same form u> can be obtained from 
each other by direct substitution of the variables (expanding the corresponding dif- 
ferentials of the coordinates followed by algebraic transformations in accordance 
with the laws of exterior products). 

If we agree to regard the form a> a as the transfer of a form co defined on a mani- 
fold to the parameter domain of the chart ( U a , (p a ), it is natural to write co a — #>*&> 
and consider that co a = ° (<Pp l )*u>p = V^p^P’ where the composition </)* o (< Pp 1 )* 

in this case plays the role of a formal elaboration of the mapping ip* a p = ( <Pp l o (p a )*. 


Definition 5 A differential m- form a> on an n -dimensional manifold M is a C tk) 
form if the coefficients (x u ) of its coordinate representation 

= ^2 a h -im Off) dx a A • • • A dx^ m 

i<ii <■■■<(„<« 

are C ik] functions in every chart (U a , <p a ) of an atlas that defines a smooth structure 
on M. 


It is clear from (15.42) that Definition 5 is unambiguous if the manifold M itself 
is a C (A:+1) manifold, for example if M is a (0 oo) manifold. 

For differential forms defined on a manifold the operations of addition, multipli- 
cation by a scalar, and exterior multiplication are naturally defined pointwise. (In 
particular, multiplication by a function / : M — »• R, which by definition is regarded 
as a form of degree zero, is defined.) The first two of these operations turn the set 
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i?™ of /w-forms of class C^ k) on M into a vector space. In the case k = oo this 
vector space is usually denoted S2 m . It is clear that exterior multiplication of forms 
a> mi e 42™ 1 and co'" 2 e £2™ 2 yields a form co m 1+ "' 2 = co' n 1 A of 12 e ^” ,1+ '” 2 . 


15.3.3 The Exterior Derivative 

Definition 6 The exterior differential is the linear operator d : Q" 1 — > 17^ 1 pos- 
sessing the following properties: 

1° On every function / e S2® the differential d : \ equals the usual 

differential d / of this function. 

2° d : (&> mi A co' n2 ) = den"’ 1 A to'" 2 + A den"' 2 , where cu"' 1 e .Q'" 1 and 

3° d 2 := do d = 0. 

This last equality means that d(deu) is zero for every form a>. 

Requirement 3° thus presumes that we are talking about forms whose smooth- 
ness is at least C (2 \ 

In practice this means that we are considering a C (oc> manifold M and the oper- 
ator d mapping C2 m to Q'" +l . 

A formula for computing the operator d in local coordinates of a specific chart 
(and at the same time the uniqueness of the operator d) follows from the relation 

d( Y Ci\...i m (x) dx' 1 A • • • A dx' 
v l</i <--<i m <n 

= Y d c n ...i m (x) A dx' 1 A ■ ■ • A dx' m + 

1<J1<— <im<n 

+ ( Y c h-,im d(dx !1 A • ■ ■ A dx' m ) = 0 J. (15.43) 

X <»1 2 

The existence of the operator d now follows from the fact that the operator de- 
fined by (15.43) in a local coordinate system satisfies conditions 1°. 2°, and 3° of 
Definition 6. 

It follows in particular from what has been said that if u> a = <p*e« an d u>p — <PpOt 
are the coordinate representations of the same form co, that is, w a — <p*p(Op, then 
dco a and dcop will also be the coordinate representations of the same form (do/), 
that is, d co a — cp*p dorp. Thus the relation d(<p*pCop) = <p*p( dcop) holds, which in 
abstract form asserts the commutativity 

dip* = (p*d (15.44) 

of the operator d and the operation (p* that transfers forms. 
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15.3.4 The Integral of a Form over a Manifold 

Definition 7 Let M be an n -dimensional smooth oriented manifold on which the 
coordinates x l , ... ,x n and the orientation are defined by a single chart (p x : D x — » 
M with parameter domain D x C R" . Then 



(15.45) 


where the left-hand side is the usual integral of the form co over the oriented man- 
ifold M and the right-hand side is the integral of the function f(x) over the do- 
main D x . 

If <p t : D t —>■ M is another atlas of M consisting of a single chart defining the 
same orientation on M as cp x : D x — > M, then the Jacobian det^'(f) of the function 
x = <p(t) of the coordinate change is everywhere positive in D, . The form 

<p*(a(x) dx 1 A • ■ • A dx") = a(x(t) ) det dr 1 A • • • A dr" 

in D, corresponds to the form at. By the theorem on change of variables in a multiple 
integral we have the equality 



which shows that the left-hand side of (15.45) is independent of the coordinate sys- 
tem chosen in M . 

Thus, Definition 7 is unambiguous. 

Definition 8 The support of a form co defined on a manifold M is the closure of the 
set of points x e M where w(x) 0. 

The support of a form co is denoted by suppcu. In the case of 0-forms, that is, 
functions, we have already encountered this concept. Outside the support the coor- 
dinate representation of the form in any local coordinate system is the zero form of 
the corresponding degree. 

Definition 9 A form co defined on a manifold M is of compact support if suppm is 
a compact subset of M . 

Definition 10 Let co be a form of degree n and compact support on an n- 
dimensional smooth manifold M oriented by the atlas A. Let ipj : D, —*■ If, 
{(Uj, cp),i = I } be a finite set of charts of the atlas A whose ranges 

U\, ... , U m cover supp<w, and let e\ et be a partition of unity subordinate to 

that covering on supp co. Repeating some charts several times if necessary, we can 
assume that m — k, and that supp c, c(/j,i = l,..., m. 
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The integral of a form co of compact support over the oriented manifold M is the 
quantity 



<Pi(ei(d), 


(15.46) 


where ffl*(e,&>) is the coordinate representation of the form e, co | u i in the domain D, 
of variation of the coordinates of the corresponding local chart. 


Let us prove that this definition is unambiguous. 

Proof Let A — {cpj : D j — > U /} be a second atlas defining the same smooth struc- 
ture and orientation on M as the atlas A, let U\, . . . , Uff, be the corresponding cov- 
ering of supp&>, and let e \, . . . , e,j, a partition of unity on supp&> subordinate to this 
covering. We introduce the functions fj = eiej, i = 1 j = 1 , , in, and 
we set a>ij = fijco. 

We remark that supp<u,j C Wi/ = Uj (T Uj . From this and from the fact that 
Definition 7 of the integral over an oriented manifold given by a single chart is 
unambiguous it follows that 

<P*(coij)= [ tp* (a>ij) — [ V*(coij)= f v*(coij). 

hrhwtj) JfjHwtj) J 1 .In, 1 

Summing these equalities on i from 1 to m and on j from 1 to m, taking account 
of the relation Y17=l fij ~ ^ j > Z]T=i f'j = e < > we t ^ le identities we are interested 
int. □ 



15.3.5 Stokes’ Formula 


Theorem Let M be an oriented smooth n-dimensional manifold and co a smooth 
differential form of degree n — 1 and compact support on M . Then 



(15.47) 


where the orientation of the boundary 3 M of the manifold M is induced by the 
orientation of the manifold M. If dM = 0, then f M d co — 0. 


Proof Without loss of generality we may assume that the domains of variation of 
the coordinates (parameters) of all local charts of the manifold M are either the open 
cube I = {x e R" | 0 < x 1 < 1 , i = 1 , . . . , n], or the cube I = {x e R" | 0 < x 1 < 
1 A 0 < x‘ < 1 , i = 1 , . . . , n] with one (definite!) face adjoined to the cube I . 

By the partition of unity the assertion of the theorem reduces to the case when 
suppcu is contained in the range U of a single chart of the form cp : / — > U or 
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<p : I — »• U . In the coordinates of this chart the form co has the form 

n _ 

® di (x) dx 1 A • ■ ■ A dx' A • • ■ A dx" , 
i=l 


where the frown as usual, means that the corresponding factor is omitted. 

By the linearity of the integral, it suffices to prove the assertion for one term of 
the sum: 

coj = cii(x) dx 1 A ■ • • A dx ! A • ■ • A dx". (15.48) 

The differential of such a form is the n-form 

dot; = (- 1)' -1 ^r(x) dx 1 a • • • a dx". (15.49) 

dx 1 

For a chart of the form (p : I U both integrals in (15.47) of the correspond- 
ing forms (15.48) and (15.49) are zero: the first because suppa; C I and the sec- 
ond for the same reason, if we take into account Fubini’s theorem and the relation 
fo Iff d x ' = fl i ( 1) ~ (0) = 0. This argument also covers the case when 3 M — 0. 

Thus it remains to verify (15.47) for a chart <p : I — >■ U . 

If i > 1, both integrals are also zero for such a chart, which follows from the 
reasoning given above. 

And if i = 1 , then 




Thus formula (15.47) is proved for n > 1. 

The case n — 1 is merely the Newton-Leibniz formula (the fundamental theorem 
of calculus), if we assume that the endpoints a and of the oriented interval [a, ft] 
are denoted «_ and ft> + and the integral of a 0-form g(x) over such an oriented point 
is equal to —g(a) and +g(ft) respectively. □ 


We now make some remarks on this theorem. 


Remark 1 Nothing is said in the statement of the theorem about the smoothness 
of the manifold M and the form o>. In such cases one usually assumes that each 
of them is C (oa K It is clear from the proof of the theorem, however, that formula 
(15.47) is also true for forms of class C (2> on a manifold M admitting a form of this 
smoothness. 
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Remark 2 It is also clear from the proof of the theorem, as in fact it was already from 
the formula (15.47), that if suppcw is a compact set contained strictly inside M, that 
is, suppcw fl 3 M = 0, then f M dew = 0. 

Remark 3 If M is a compact manifold, then for every form to on M the support 
supp to, being a closed subset of the compact set M , is compact. Consequently in this 
case every form to on M is of compact support and Eq. (15.47) holds. In particular, 
if M is a compact manifold without boundary, then the equality J M dew = 0 holds 
for every smooth form on M. 


Remark 4 For arbitrary forms to (not of compact support) on a manifold that is not 
itself compact, formula (15.47) is in general not true. 

Let us consider, for example, the form to = x in a circular annulus M — 

{(x, y) e R 2 | 1 < x 2 + y 2 < 2}, endowed with standard Cartesian coordinates. In 
this case M is a compact two-dimensional oriented manifold, whose boundary 3 M 
consists of the two circles C; = {(x, y) e R 2 | x 2 + y 2 = / }, i = 1, 2. Since dew = 0, 
we find by formula (15.47) that 



where both circles C\ and C 2 are traversed counterclockwise. We know that 



to — 2 jt ^ 0 . 


Hence, if we consider the manifold M = M\C 1 , then 3 M — C 2 and 


/-■ 


dco = 0^2jr 



15.3.6 Problems and Exercises 

1. a) We call two smooth paths y, : R — > M , i = 1,2 on a smooth manifold M 
tangent at a point p e M if y\ (0) = /2 (0) = p and the relation 

\tp~ { o y\(t) — o y 2 {t)\= o{t) as t 0 (15.50) 

holds in each local coordinate system <p : R" —*■ U (or q> : H" — >■ U ) whose range 
U contains p. Show that if (15.50) holds in one of these coordinate systems, then 
it holds in any other local coordinate system of the same type on the smooth mani- 
fold M. 

b) The property of being tangent at a point p e M is an equivalence relation on 
the set of smooth paths on M passing through p. We call an equivalence class a bun- 
dle of tangent paths at p e M. Establish the one-to-one correspondence exhibited 
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in Sect. 15.3.1 between vectors of TMp and bundles of tangent paths at the point 
p e M. 

c) Show that if the paths y\ and yi are tangent at p e M and / e C® (M, R), 
then 


d f o y i dfoyj 

J r -(0) = J yz (0). 


dt 


dt 


d) Show how to assign a functional / = l^(= D ^) : C (oo ^(M, M) — ► M possessing 
properties (15.37) and (15.38), where aq = p, to each vector £ e TM p . A functional 
possessing these properties is called a derivation at the point p e M. 

Verify that differentiation / at the point p is a local operation, that is, if f\ , fj e 
C (00) and fi(x) = f' 2 (x) in some neighborhood of p, then lf\ — l fi- 
ef Show that if x 1 , . . . , x n are local coordinates in a neighborhood of the point 
p , then l — ^"_i(Zx ! )^j, where is the operation of computing the partial 
derivative with respect to x' at the point x corresponding to p. (Hint. Write 
the function f\u(p) : M -> R in local coordinates; remember that the expansion 
f(x ) = /( 0) + x ' Si W holds for the function / e C (o °)(R", R), where 

gi e C (oo) (R", R) and g,-(0) = g-(O), £ = 1 n.) 

f) Verify that if M is a C (oai manifold, then the vector space of derivations at 
the point p e M is isomorphic to the space TM p tangent to M at p constructed in 
Sect. 15.3.1. 

2. a) If we fix a vector f (p) e TM p at each point p e M of a smooth manifold M, 
we say that a vector field is defined on the manifold M. Let X be a vector field on M. 
Since by the preceding problem every vector X ( p ) = f e TM p can be interpreted as 
differentiation at the corresponding point p, from any function / e C (oa) (M, R) one 
can construct a function X f(p) whose value at every point p e M can be computed 
by applying X ( p ) to /, that is, to differentiating / with respect to the vector X ( p) 
in the field X. A field X on M is smooth (of class C <0O> ) if for every function 
/ e C (00) (M,R) the function Xf also belongs to C <00 HM , R). 

Give a local coordinate expression for a vector field and the coordinate definition 
of a smooth (C (DO) ) vector field on a smooth manifold equivalent to the one just 
given. 

b) Let X and Y be two smooth vector fields on the manifold M . For functions 
/ e C (oo )(AL®0 we construct the functional [V, Y]f = X(Yf) — Y(Xf). Verify 
that [V, Y] is also a smooth vector field on M . It is called the Poisson bracket of the 
vector fields X and Y . 

c) Give a Lie algebra structure to the smooth vector fields on a manifold. 


3. a) Let X and a> be respectively a smooth vector field and a smooth 1 -form on a 
smooth manifold M. Let a>X denote the application of o> to the vector of the field 
X at corresponding points of M. Show that a>X is a smooth function on M. 
b) Taking account of Problem 2, show that the following relation holds: 


d co\x, Y) = X(co l Y) - Y{co l X) - w\[X, T]), 
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where X and Y and smooth vector fields, dco 1 is the differential of the form co 1 , 
and dof (X, Y) is the application of dco 1 to pairs of vectors of the fields X and Y 
attached at the same point, 
c) Verify that the relation 

d co (Xj , , X m +\) = 

ra+1 

= X!(-i y +l Xiu(Xu-..,Xj,...,x m+l ) + 

i = 1 

+ J2 (-1 ) i+j co([X i ,Xj],X 1 ,...,X i ,...,Xj,...,X m+ i) 

1 <i < j<m + 1 


holds for the general case of a form co of order m. Here the frown — denotes an 
omitted term, [ X Xj] is the Poisson bracket of the fields X ,• and Xj, and X, a> 
represents differentiation of the function co(X i, X, , , X m+ \) with respect to 
the vectors of the field Xj . Since the Poisson bracket is invariantly defined, the 
resulting relation can be thought of as a rather complicated but invariant definition 
of the exterior differential operator d : L? — > £2 . 

d) Let co be a smooth m -form on a smooth n -dimensional manifold M. Let 
(fi, . . . , § m+ i ),- be vectors in R" corresponding to the vectors £i, . . . , % m +\ e TM p 
in the chart cpi : R" -> U C M . We denote by 77/ the parallelepiped formed by the 
vectors (fi, . . . , f m +i); in IB." , and let 1/7,- be the parallelepiped spanned by the 
vectors (k£i, . . . , );. We denote the images ^, (77/) and ^, (7.77,) of these par- 

allelepipeds in M by 77 and 777 respectively. Show that 

d£o(p)(£i,...,£ m +i)= lim — [ -r f co. 

^ 7a(A/7) 

4. a) Let / : M — >• TV be a smooth mapping of a smooth m -dimensional manifold 
M into a smooth n -dimensional manifold N. Using the interpretation of a tangent 
vector to a manifold as a bundle of tangent paths (see Problem 1), construct the 
mapping /*(p) : TM p -* 77V f(p) induced by /. 

b) Show that the mapping /* is linear and write it in corresponding local coor- 
dinates on the manifolds M and TV. Explain why f t ( p) is called the differential of 
f at p or the mapping tangent to f at that point. 

Let / be a diffeomorphism. Verify that /*[X, Y] = [/*X, /*L]. Here X and Y 
are vector fields on M and [■, •] is their Poisson bracket (see Problem 2). 

c) As is known from Sect. 15.1, the tangent mapping ffip ) : I'M p — »• TN q= f( p ) 
of tangent spaces generates the adjoint mapping f*(p) of the conjugate spaces and 
in general a mapping of 7-forms defined on 77V f^ p ) and TM p . 

Let co be a 7-form on TV. The k-form f *o> on M is defined by the relation 


(/*«) {pm t , . • • , /*&), 
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where £ 1 , € TM p . In this way a mapping /* : f2 k (N) —*■ £2 k (M) arises from 

the space Q k (N ) of Worms defined on N into the space Q k (M) of Worms defined 
on M. 

Verify the following properties of the mapping /*, assuming M and N are C (0O) 
manifolds: 

1° f* is a linear mapping; 

2° f*((0\ A CO 2 ) — f*CO 1 A 

3° do/* = fod, that is d(/*<w) = /*(dcw); 

4° (f2ofi)* = f*of*. 

d) Let M and N be smooth n -dimensional oriented manifolds and tp : M — »■ N 
a diffeomorphism of M onto N. Show that if to is an Worm on N with compact 
support, then 


f 

Jip(M) JM 


s|c 

<p CO, 


where £ = 


1, if <p preserves orientation, 


-1, if (p reverses orientation. 

e) Suppose A D B. The mapping i : />’ — > A that assigns to each point x e B 
that same point as an element of A is called the canonical embedding of B in A. 

If cu is a form on a manifold M and M' is a submanifold of M, the canonical 
embedding i : M' —*■ M generates a form i*co on M' called the restriction of co 
to M' . Show that the proper expression of Stokes’ formula (15.47) should be 


[ dco= [ 1 

JM JdM 


where i : 3 M -+ M is the canonical embedding of 3 M in M , and the orientation of 
3M is induced from M. 

5. a) Let M be a smooth (C ,0O) ) oriented n-dimensional manifold and Q" ( M ) the 
space of smooth (C (0 °^) r; -forms with compact support on M. Show that there exists 
a unique mapping f M : C2"(M) M having the following properties: 

1° the mapping f M is linear; 

2° if tp : /" — > U C M (or <p : /" —*■ U C M) is a chart of an atlas defining the 
orientation of M , supp co C U, and co = a (x ) dr 1 A ■ • • A dx n in the local coordinates 
x 1 , . . . , x" of this chart, then 



a(x) dx 1 , . . . , dx" 


or 


[ co= [ a (x) dx 1 , . . . , dx" ) , 
JM J7 n ) 


where the right-hand side contains the Riemann integral of the function a over the 
corresponding cube /" (or /”). 

b) Can the mapping just exhibited always be extended to a mapping f M : 
Q n (M) —> R of all smooth Worms on M, retaining both of these properties? 

c) Using the fact that every open covering of the manifold M has an at most 
countable locally finite refinement and the fact that there exists a partition of unity 
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subordinate to any such covering (see Problem 9), define the integral of an n-form 
over an oriented smooth n -dimensional (not necessarily compact) manifold so that 
it has properties 1° and 2° above when applied to the forms for which the integral is 
finite. Show that for this integral formula (15.47) does not hold in general, and give 
conditions on w that are sufficient for (15.47) in the case when M — R" and in the 
case when M — H n . 

6. a) Using the theorem on existence and uniqueness of the solution of the differen- 
tial equation x — v(x) and also the smooth dependence of the solution on the initial 
data, show that a smooth bounded vector field v(x) e M" can be regarded as the ve- 
locity field of a steady-state flow. More precisely, show that there exists a family of 
diffeomorphisms cp, : R" — > R" depending smoothly on the parameter t (time) such 
that <p, (x ) is an integral curve of the equation for each fixed value of x e R" , that is, 
) = v{(p t (x)) and <po(x) = x. The mapping < p t : R” — > R" obviously character- 
izes the displacement of the particles of the medium at time t. Verify that the family 
of mappings <p t : R" — » R" is a one-parameter group of diffeomorphisms , that is, 
(<Pt)~ l — <P-t , and qo, 2 o <p n = <p, l+t2 . 

b) Let i) be a vector field on R" and (p, a one-parameter group of diffeomor- 
phisms of R" generated by v. Verify that the relation 

limi(/(^(x)) - f{x)) = D v(x) f 

holds for every smooth function / e C (oo) (R'', R). 

If we introduce the notation v(f) := D v f, in consistency with the notation of 
Problem 2, and recall that / o cp, =: (pf /, we can write 

lim - (cp* f — f){x) = v(f)(x). 

t^o t 

c) Differentiation of a smooth form co of any degree defined in R" along the 
field v is now naturally defined. To be specific, we set 

i>(a>)(x) := lim -(wfco — co)(x). 
f— >o t 

The form v(co) is called the Lie derivative of the form a> along the field v and 
usually denoted L v ca. Define the Lie derivative LyO) of a form co along the field X 
on an arbitrary smooth manifold M. 

d) Show that the Lie derivative on a C , ' X)) manifold M has the following prop- 
erties. 

1° Lx is a local operation, that is, if the fields X\ and AS and the forms o>\ and on 
are equal in a neighborhood 1/cMof the point x, then (LxjCWtXx) = iLx 2 co 2 ){x). 

2° L x L2 k {M) C L2 k (M). 

3° Lx : L2 k (M) —*■ L2 k (M) is a linear mapping for every k — 0, 1,2, 

4° Lx(co i A oof) — (Lxoo\) A 002 + ooi A Lxtoj- 

5° If / e £2°(M), then L x f = d f(X) =: Xf. 

6° If / e L2°{M), then L x df = d (Xf). 
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e) Verify that the properties l°-6° determine the operation Lx uniquely. 

7. Let X be a vector field and co a form of degree k on the smooth manifold M. 
The inner product of the field X and the form a> is the ( k — l)-form de- 
noted by ixco or Xjco and defined by the relation (ixco)(X \,...,Xk-\) 

co(X, Vi,..., Xk-i), where X i X^-i are vector fields on M. For 0-forms, that 

is, functions on M, we set X \ f = 0. 

a) Show that if the form u> (more precisely, co\ jj) has the form 

Y, ai v ..i k (x) dx l] A • • • A dx lk = — a h---ik d-* 4 A • • • A dx lk 
l<il<—<ik<n 

in the local coordinates x 1 , . . . , x n of the chart w : K" — > JJ C M, and X — X' ttt , 

T OX 1 

then 

ixco = Jj~[y X l a ‘h-ik & x ' 2 A • • • A dx' k . 

b) Verify further that if d / = dx ! , then ix d f — X‘ ^ — X (/) = Dxf- 

c) Let X (M) be the space of vector fields on the manifold M and Q(M ) the 
ring of skew-symmetric forms on M. Show that there exists only one mapping i : 
X(M) x QUvl) Q(M) having the following properties: 

1° i is a local operation, that is, if the fields Xi and AA and the forms co\ and on 
are equal in a neighborhood U of x e M, then O'xi&hXx) = (ix 2 OJ 2 )(x)', 

2° i x (L2 k (M)) C L2 k ~ l (M); 

3° ix ■ L2 k {M) — »■ L2 k ~ l (M) is a linear mapping; 

4° if a > i e L2 kl {M) and a >2 e L2 kl {M), then ix(co\ A C 02 ) — ixco 1 A 002 + 
{-i) kl co\ A i x co 2 ', 

5° if co e £2 l (M), then ixco = co(X), and if / e L2°(M), then ixf — 0. 

8. Prove the following assertions. 

a) The operators d, ix, and Lx (see Problems 6 and 7) satisfy the so-called 
homotopy identity 

Lx = ixd + dix, (15.51) 

where X is any smooth vector field on the manifold. 

b) The Lie derivative commutes with d and i x ■ that is. 

Lx ° d = d o Lx, L x oix = ix°L X - 

c) [Lx, iy] = i[X,Y\, [Lx, Ly] = L[x,F], where, as always, [A, B] = A o B — 
B o A for any operators A and B for which the expression A o B — B o A is defined. 
In this case, all brackets [ , ] are defined. 

d) Lxfco = fLxco + d f A ixco, where / e L2°(M) and w e £2 k (M). 

(Hint. Part a) is the main part of the problem. It can be verified, for example, by 
induction on the degree of the form on which the operators act.) 
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15.4 Closed and Exact Forms on Manifolds 
15.4.1 Poincare’s Theorem 

In this section we shall supplement what was said about closed and exact differential 
forms in Sect. 14.3 in connection with the theory of vector fields in JR". As before, 
P2 P (M) denotes the space of smooth real- valued forms of degree p on the smooth 
manifold M and f2(M) = (J /; Q P (M). 

Definition 1 The form co e Q p (M) is closed if do) = 0. 

Definition 2 The form co e Q p ( M ) , p > 0, is exact if there exists a form a e 
f2 p ~ l (M) such that co — da. 

The set of closed p -forms on the manifold M will be denoted Z P (M), and the 
set of exact /)-forms on M will be denoted B P (M). 

The relation 7 d(do;) = 0 holds for every form co e Q(M), which shows that 
Z P (M ) D B P (M). We already know from Sect. 14.3 that this inclusion is gener- 
ally strict. 

The important question of the solvability (for a) of the equation da — co given 
the necessary condition dco — 0 on the form o> turns out to be closely connected with 
the topological structure of the manifold M. This statement will be deciphered more 
completely below. 

Definition 3 We shall call a manifold M contractible (to the point xq e M ) or 
homotopic to a point if there exists a smooth mapping h : M x I — > M where 
7 = {feR|0<t<l} such that h(x, 1) = x and h(x, 0) = xo. 

Example 1 The space R" can be contracted to a point by the mapping h(x,t) = tx. 

Theorem 1 (Poincare) Every closed (p + V)-form (p > 0) on a manifold that is 
contractible to a point is exact. 

Proof The nontrivial part of the proof consists of the following “cylindrical” con- 
struction, which remains valid for every manifold M. 

Consider the “cylinder”, M x I , which is the direct product of M and the closed 
unit interval I , and the two mappings j, : M -x M x I , where _/, (x) = (x, i), i — 
0, 1 , which identify M with the bases of the cylinder M x I. Then there naturally 
arise mappings j* : Q p ( M x /) Q p (M), reducing to the replacement of the 
variable t in a form of Q p ( M x I) by the value i (= 0, 1), and, of course, d; = 0. 


7 Depending on the way in which the operator d is introduced this property is either proved, in 
which case it is called the Poincare lemma , or taken as part of the definition of the operator d. 
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We construct a linear operator K : Q p+] (M x I) — ► Q p ( M ) , which we define 
on monomials as follows: 

K(a(x, t) dx' 1 A • ■ • A dr' p+1 ) := 0, 

K (a(x, t ) dr A dx' 1 A • • • A dx' p ) := a(x, t ) df^j dx' 1 A • ■ • A dx' p . 

The main property of the operator K that we need is that the relation 

K{dm) + d(Km) = j*m - j$co (15.52) 

holds for every form co e f2 /,+ l (M x I). 

It suffices to verify this relation for monomials, since all the operators K , d, j*, 
and /q are linear. 

If co — a(x, t ) dx' 1 A • • • A dx' p+1 , then Km — 0, d (Km) — 0, and 
3 a 

dm — — dr A dx 1 A • ■ • A cLr p+1 + [terms not containing dr] , 

3 r 


K(dm) = 


-u: 


da 

Jt 


dr ) dx' 1 A • • • A dx' p+1 = 


= (a(x, 1) — a(x, 0)) dx' 1 A • ■ • A dx !p+1 = j*co— 
and relation (15.52) is valid. 

If m — a(x, r) dr A dx' 1 A ■ • ■ A dx' p , then j*co= j£m — 0. Then 
K{dm) = K dt A djr '° A dxh A • • • A dx' p j = 


m. 




— — dr ) dx'° A • ■ ■ A dx' p , 


d (Km) = d 


/'■ 


a(x, r) dr ) dx' 1 A • • • A dx' p ) = 


— y — — ( / a(x, t) dt ) dx'° A dx ' 1 A ■ • ■ A dx' p = 
io 


= V — ( [ l a(x, 
4- 9x'° o 
1 0 


da , 

— — dr ) dx 0 A dx 1 A • • • A dx p . 
3x'° 


Thus relation (15.52) holds in this case also. Now let M be a manifold that is 
contractible to the point xq € M , let h : M x I —> M be the mapping in Definition 3, 


8 For the justification of the differentiation of the integral with respect to x‘° in this last equality, 
see, for example, Sect. 17.1. 
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and let co be a (p + l)-form on M. Then obviously ho j\ : M — > M is the identity 
mapping and ho jo : M -» xq is the mapping of M to the point xo, so that ( j* o 
h*)co — co and (j/f o h*)co = 0. Hence it follows from (15.52) that in this case 

K(d(h*co)) + d(K(h*co)) =co. (15.53) 

If in addition co is a closed form on M, then, since d (h*co) — h*(dco) = 0, we find 
by (15.53) that 


d(K(h*co)) =co. 

Thus the closed form co is the exterior derivative of the form a — K (h*a>) e 
£2 P {M), that is, co is an exact form on M. □ 


Example 2 Let A, B, and C be smooth real- valued functions of the variables 
x, y,z e R 3 . We ask how to solve the following system of equations for P, Q, 
and R : 


9P _ 9Q =a 

dy dz ~ 

d P dR 

= B, 

dz dx 

d Q dP 

— = c. 

dx dy 


(15.54) 


An obvious necessary condition for the consistency of the system (15.54) is that 
the functions A, B, and C satisfy the relation 


3 A 3 B dC 
dx dy dz 


which is equivalent to saying that the form 


co — A d v A dz + B dz A dx + C dx A dy 


is closed in R 3 . 

The system (15.54) will have been solved if we find a form 
a = P dx + Q dy + R dz 


such that d a — co. 

In accordance with the recipes explained in the proof of Theorem 1, and taking 
account of the mapping h constructed in Example 1, we find, after simple computa- 


356 


15 integration of Differential Forms on Manifolds 


tions, 


a = K 


^) = (l 


A(tx, ty, tz)tdt )(ydz — zdy) + 


+ ^ B(tx,ty,tz)t dt'jizdx — x dz) + 

+ C(tx, ty, tz)tdt^j(x dy — y dx). 
One can also verify directly that d a = co. 


Remark The amount of arbitrariness in the choice of a form a satisfying the con- 
dition dcr = co is usually considerable. Thus, along with a, any form a + di] will 
obviously also satisfy the same equation. 


By Theorem 1 any two forms a and f on a contractible manifold M satisfying 
da = dfi = co differ by an exact form. Indeed, d(a — /)) = 0, that is, the form (a — ft) 
is closed on M and hence exact, by Theorem 1. 


15.4.2 Homology and Cohomology 

By Poincare’s theorem every closed form on a manifold is locally exact. But it is by 

no means always possible to glue these local primitives together to obtain a single 

form. Whether this can be done depends on the topological structure of the manifold. 

For example, the closed form in the punctured plane R 2 \0 given by co = ~ } ’.^V l ~^ d ' V , 

% i y 

studied in Sect. 14.3, is locally the differential of a function (p — (p(x, y) - the polar 
angle of the point (x, y). However, extending that function to the domain K 2 \0 leads 
to multivaluedness if the closed path over which the extension is carried out encloses 
the hole - the point 0. The situation is approximately the same with forms of other 
degrees. “Holes” in manifolds may be of different kinds, not only missing points, 
but also holes such as one finds in a torus or a pretzel. The structure of manifolds of 
higher dimensions can be rather complicated. The connection between the structure 
of a manifold as a topological space and the relationship between closed and exact 
forms on it is described by the so-called (co)homology groups of the manifold. 

The closed and exact real-valued forms on a manifold M form the vector spaces 
Z P {M) and B P (M) respectively, and Z P (M ) D B P {M). 

Definition 4 The quotient space 

H P (M) Z P (M)/B P (M) (15.55) 

is called the p-dimensional cohomology group of the manifold M (with real coeffi- 
cients). 
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Thus, two closed forms u> i, o >2 e Z P (M) lie in the same cohomology class, or 
are cohomologous, if co\ — a >2 e B P (M), that is, if they differ by an exact form. The 
cohomology class of the form o> e Z P (M) will be denoted [&>]. 

Since Z P (M) is the kernel of the operator d p : Q p (M) — >• f2 p+] (M), and 
B P (M ) is the image of the operator d p ~ l : H2 p ~ l (M) — »■ Q P (M), we often write 


H P (M) — Kerdf’/Imd^ -1 . 


Computing cohomologies, as a rule, is difficult. However, certain trivial general 
observations can be made. 

It follows from Definition 4 that if p > dimM, then H P (M ) = 0. 

It follows from Poincare’s theorem that if M is contractible then H P (M ) = 0 for 

p > 0. 

On any connected manifold M the group H°(M) is isomorphic to R, since 
H°(M) — Z°(Af), and if d / = 0 holds for the function / : M — > M on a connected 
manifold M, then / = const. 

Thus, for example, it results forM" that H P (W‘) = 0 for p > 0 and H°(S, n ) ~ M. 
This assertion (up to the trivial last relation) is equivalent to Theorem 1 with M — 
R" and is also called Poincare’s theorem. 

The so-called homology groups have a more visualizable geometrical relation to 
the manifold M. 

Definition 5 A smooth mapping c:/ p ->Mof the p -dimensional cube I C RR 
into the manifold M is called a singular p-cube on M. 

This is a direct generalization of the concept of a smooth path to the case of an 
arbitrary dimension p. In particular, a singular cube may consist of a mapping of 
the cube I to a single point. 

Definition 6 A p-chain (of singular cubes) on a manifold M is any finite formal 
linear combination ci^Ck of singular p -cubes on M with real coefficients. 

Like paths, singular cubes that can be obtained from each other by a diffeomor- 
phic change of the parametrization with positive Jacobian are regarded as equivalent 
and are identified. If such a change of parameter has negative Jacobian, then the cor- 
responding oppositely oriented singular cubes c and c_ are regarded as negatives of 
each other, and we set c_ = — c. 

The p-chains on M obviously form a vector space with respect to the standard 
operations of addition and multiplication by a real number. We denote this space by 
C P (M). 

Definition 7 The boundary 3/ of the p-dimensional cube I p in RR is the (p — 1)- 
chain 


p 



i=o j= i 


(15.56) 
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in R p , where c,-; : 7 P_1 -» is the mapping of the (p — 1) -dimensional cube 
into WL P induced by the canonical embedding of the corresponding face of I p in R p . 
More precisely, if I p ~ 1 ={ie R p ~ 1 | 0 < x m < 1 ,m = 1 , . . . , p — 1 }, then c,/ (x) = 

It is easy to verify that this formal definition of the boundary of a cube agrees 
completely with the operation of taking the boundary of the standard oriented 
cube IP (see Sect. 12.3). 

Definition 8 The boundary 3c of the singular p-cube c is the ( p — l)-chain 

9c: =EE(-i) ,+;coc o- 

(=0 y=l 

Definition 9 The boundary of a /? -chain atck on the manifold M is the (p — 11- 
chain 

:= y ^aicdcjc. 

' k ' k 

Thus on any space of chains C p (M) we have defined a linear operator 
d = d p : C p (M) — »■ C p -\(M). 

Using relation (15.56), one can verify the relation 3(3 1) — 0 for the cube. Con- 
sequently 3o3 = 3 2 = 0in general. 

Definition 10 A p-cycle on a manifold is a /? -chain z for which 3 z = 0. 

Definition 11 A boundary p-cycle on a manifold is a p -chain that is the boundary 
of some (p+ l)-chain. 

Let Z p (M) and B p (M) be the sets of p-cycles and boundary /) -cycles on the 
manifold M. It is clear that Z p (M ) and B p (M) are vector spaces over the field R 
and that Z p {M) D B p (M). 

Definition 12 The quotient space 

H p (M) := Z p {M)/B p (M) (15.57) 

is the p-dimensional homology group of the manifold M (with real coefficients). 

Thus, two cycles Z\,Z 2 € Z p (M) are in the same homology class, or are homol- 
ogous, if zi — zi € B p (M), that is, they differ by the boundary of some chain. We 
shall denote the homology class of a cycle z e Z p (M) by ["]. 


15.4 Closed and Exact Forms on Manifolds 


359 


As in the case of cohomology, relation (15.57) can be rewritten as 


H p {M) = Ke,rdp/lmd p+ i. 


Definition 13 If c : I — »• M is a singular /7-cube and w is a /7-form on the mani- 
fold M, then the integral of the form co over this singular cube is 



(15.58) 


Definition 14 If Jf k ctkCk is a p-chain and co is a /7-form on the manifold M, 
the integral of the form over such a chain is interpreted as the linear combination 
a k f a> of the integrals over the corresponding singular cubes. 

It follows from Definitions 5-8 and 13-14 that Stokes’ formula 



(15.59) 


holds for the integral over a singular cube, where c and co have dimension p and 
degree p — l respectively. If we take account of Definition 9, we conclude that 
Stokes’ formula (15.59) is valid for integrals over chains. 

Theorem 2 a) The integral of an exact form over a cycle equals zero. 

b) The integral of a closed form over the boundary of a chain equals zero. 

c) The integral of a closed form over a cycle depends only on the cohomology 
class of the form. 

d) If the closed p-forms co i and coi and the p-cycles z i and zi are such that 
[®l] = [ 002 ] arid [zi] = [ 22 ], then 



Proof a) By Stokes’ formula co dz = J 3 _ co — 0, since dz = 0. 

b) By Stokes’ formula f dc co — f c dco — 0, since d co — 0. 

c) follows from b). 

d) follows from a). 

e) follows from c) and d). 


□ 


Corollary The bilinear mapping T2 P {M) x C p {M) -* R defined by ( co,c ) m* 
f c co induces a bilinear mapping Z P (M) x Z p (M ) — > R and a bilinear mapping 
H P (M) x Hp(M) —*■ R. The latter is given by the formula 



(15.60) 


where co e Z P (M ) and z e Z p (M). 
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Theorem 3 (de Rham 9 ) The bilinear mapping H P (M) x H p ( M ) R given by 

(15.60) is nondegenerate . 10 

We shall not take the time to prove this theorem here, but we shall find some 
reformulations of it that will enable us to present in explicit form some corollaries 
of it that are used in analysis. 

We remark first of all that by (15.60) each cohomology class [&>] e H P (M) 
can be interpreted as a linear function [<w]([z]) = j. co. Thus a natural mapping 
H P (M) — »■ H*(M) arises, where II* ( M ) is the vector space conjugate to H p (M). 
The theorem of de Rham asserts that this mapping is an isomorphism, and in this 
sense H P {M) = H*(M). 

Definition 15 If cu is a closed p-form and z is a p-cycle on the manifold M, then 
the quantity per(z) := f. co is called the period (or cyclic constant) of the form co 
over the cycle z. 

In particular, if the cycle z is homologous to zero, then, as follows from asser- 
tion b) of Theorem 2, we have per(z) = 0. For that reason the following connection 
exists between periods: 



(15.61) 


that is, if a linear combination of cycles is a boundary cycle, or, what is the same, is 
homologous to zero, then the corresponding linear combination of periods is zero. 

The following two theorems of de Rham hold; taken together, they are equivalent 
to Theorem 3. 

Theorem 4 (de Rham’s first theorem) A closed form is exact if and only if all its 
periods are zero. 

Theorem 5 (de Rham’s second theorem) If a number per(z) is assigned to each p- 
cycle z € Zp(M ) on the manifold M in such a way that condition (15.61) holds, then 
there is a closed p-form co on M such that f„co — per {z) for every cycle z e Z p (M). 


15.4.3 Problems and Exercises 

1. Verify by direct computation that the form a obtained in Example 2 does indeed 
satisfy the equation da = a>. 


9 G. de Rham (1903-1969) - Belgian mathematician who worked mainly in algebraic topology. 

10 We recall that a bilinear form L(x, y) is nondegenerate if for every fixed nonzero value of one of 
the variables the resulting linear function of the other variable is not identically zero. 


15.4 Closed and Exact Forms on Manifolds 


361 


2. a) Prove that every simply-connected domain in R 2 is contractible on itself to a 
point. 

b) Show that the preceding assertion is generally not true in R 2 . 

3. Analyze the proof of Poincare’s theorem and show that if the smooth mapping 
h : M x 7 — > M is regarded as a family of mappings h, : M — »■ M depending on the 
parameter t, then for every closed form to on M all the forms h*to. tel, will be in 
the same cohomology class. 

4. a) Let 1 1 ->- h t e C (oo> ( M , N ) be a family of mappings of the manifold M into 
the manifold N depending smoothly on the parameter t e I C R. Verify that for 
every form to e Q(N) the following homotopy formula holds: 

-(h*co){x) = &h*{ixoo)(.x) + h*{ix&co){x). (15.62) 

Here x e M, X is a vector field on N with X (x, t) e TNj, t ( x ) , X(x.t) is the velocity 
vector for the path t' h,i{x) at t' = t, and the operation i x of taking the inner 
product of a form and a vector field is defined in Problem 7 of the preceding section. 

b) Obtain the assertion of Problem 3 from formula (15.62). 

c) Using formula (15.62), prove Poincare’s theorem (Theorem 1) again. 

d) Show that if K is a manifold that is contractible to a point, then H P (K x 
M) — H P (M) for every manifold M and any integer p. 

e) Obtain relation (15.51) of Sect. 15.3 from formula (15.62). 

5. a) Show, using Theorem 4, and also by direct demonstration, that if a closed 
2-form on the sphere S 2 is such that f s2 to — 0, then to is exact. 

b) Show that the group H 2 (S 2 ) is isomorphic to R. 

c) Show that H 1 (S 2 ) — 0. 

6. a) Let tp : S 2 — > S 2 be the mapping that assigns to each point x e S 2 the antipo- 
dal point —x e .S' 2 . Show that there is a one-to-one correspondence between forms 
on the projective plane RR 2 and forms on the sphere S 2 that are invariant under the 
mapping tp, that is, tp*co — to. 

b) Let us represent RR 2 as the quotient space S 2 /r, where /’ is the group of 
transformations of the sphere consisting of the identity mapping and the antipodal 
mapping tp. Let i r : S 2 -*■ RP 2 = S 2 / T be the natural projection, that is 7r(x) = 
{x, — x}. Show that n o tp — n and verify that 

Wr 1 e^ p (S 2 ) (<p*ri = ri)^lcoen p (WP 2 ) (it *«=>;). 

c) Now show, using the result of Problem 5a), that H 2 (]RP 2 ) = 0. 

d) Prove that if the function / e C(S 2 , M) is such that fix) — /(— x) = const, 
then / = 0. Taking account of Problem 5c), deduce from this that // 1 (RR 2 ) = 0. 

7. a) Representing RP 2 as a standard rectangle 77 with opposite sides identified as 
shown by the orienting arrows in Fig. 15.3, show that 3/7 = 2c' — 2c, 3c = P — Q, 
and 3 c' = P — Q. 
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Fig. 15.3 Q p 


n 


P c Q 

b) Deduce from the observations in the preceding part of the problem that 
there are no nontrivial 2-cycles on RP 2 . Then show by de Rham’s theorem that 

H 2 0 RP 2 ) = 0. 

c) Show that the only nontrivial 1 -cycle on RP 2 (up to a constant factor) is 
the cycle d — c, and since d — c = *9/7, deduce from de Rham’s theorem that 
// 1 (RP 2 ) = 0. 

8. Find the groups H°(M), H l (M), and H 2 (M ) if 

a) M = S l - the circle; 

b) M — T 2 - the two-dimensional torus; 

c) M — K 2 - the Klein bottle. 

9. a) Prove that diffeomorphic manifolds have isomorphic (co)homology groups 
of the corresponding dimension. 

b) Using the example of R 2 and RP 2 , show that the converse is generally not 
true. 

10. Let X and Y be vector spaces over the field R and L(x, y ) a nondegenerate 
bilinear form L : X x Y —> R. Consider the mapping X — » Y* given by the corre- 
spondence X 3 x L(x, •) e Y*. 

a) Prove that this mapping is injective. 

b) Show that for every system yi, • . . , yu of linearly independent vectors in Y 
there exist vectors x 1 , ... ,x k such that x l ( yj ) = L(x l , yj) — S'., where S'. = 0 if 
i ^ j and d — 1 if i — j . 

c) Verify that the mapping X -» Y* is an isomorphism of the vector spaces X 
and Y*. 

d) Show that de Rham’s first and second theorems together mean that H p ( M ) — 
H* ( M ) up to isomorphism. 




Chapter 16 

Uniform Convergence and the Basic Operations 
of Analysis on Series and Families of Functions 


16.1 Pointwise and Uniform Convergence 

16.1.1 Pointwise Convergence 


Definition 1 We say that the sequence {/„; n e N} of functions f, : X -> K con- 
verges at the point x e X if the sequence of values at x, {/„(.*): n e N}, converges. 

Definition 2 The set of points Eclat which the sequence {/„; n e N} of func- 
tions f n : X — y R converges is called the convergence set of the sequence. 

Definition3 On the convergence set of the sequence of functions {/„;« e N} 
there naturally arises a function /:£’-» M defined by the relation fix) 
lim^^oo f n {x). This function is called the limit function of the sequence {/„; n e N} 
or the limit of the sequence {/„; n e N}. 

Definition 4 If / : E — > K is the limit of the sequence {/„; n e N|, we say that the 
sequence of functions converges (or converges pointwise) to f on E. 


In this case we write f(x) = lim„_ >00 f n (x) on E or f n -> f on E as n -* oo. 


Example 1 Let X — {.r e R | x > 0} and let the functions f n : X — > M be given by 
the relation f n (x) = x n , n e N. The convergence set of this sequence of functions is 
obviously the closed interval I — [0, 1] and the limit function /' : / — > R is defined 
by 


fix) = 


0, if 0 < x < 1 , 

1, if x = 1 . 


2 

Example 2 The sequence of functions f„(x) = sin ” A on M converges on M to the 
function / : M 0 that is identically 0. 
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Example 3 The sequence f n (x) — S1 "” A also has the identically zero function / : 
R. — >■ 0 as its limit. 

Example 4 Consider the sequence /„ (x ) = 2(n + l)x(l — x 2 )" on the closed inter- 
val I = [0, 1]. Since nq n — » 0 for \q\ < 1, this sequence tends to zero on the entire 
closed interval I . 


Example 5 Let m, n e N, and let f n (x) := lim„_ >00 (cosm!;rx) 2 ". If m\x is an inte- 
ger, then f m (x ) = 1, and if mix ^ Z, obviously f m (x ) — 0. 

We shall now consider the sequence {/„,; m e N} and show that it converges on 
the entire real line to the Dirichlet function 


V(x) = 


0, if x ^ Q, 

1, if x e Q. 


Indeed, if x ^ Q, then mix ^ Z, and f m (x ) = 0 for every value of m e N, so that 
f(x) = 0. But if x = £, where p e Z and q e N, then mix e Z for m > q, and 
/„, (x) = 1 for all such m, which implies /(x) = 1. 

Thus lim m ^oo f m (x) = V{x). 


16.1.2 Statement of the Fundamental Problems 

Limiting passages are encountered at every step in analysis, and it is often impor- 
tant to know what kind of functional properties the limit function has. The most 
important properties for analysis are continuity, differentiability, and integrability. 
Hence it is important to determine whether the limit is a continuous, differentiable, 
or integrable function if the prelimit functions all have the corresponding property. 
Here it is especially important to find conditions that are sufficiently convenient in 
practice and which guarantee that when the functions converge, their derivatives or 
integrals also converge to the derivative or integral of the limit function. 

As the simple examples examined above show, without some additional hypothe- 
ses the relation “ f n -» / on [a, b] as n — > oo” does not in general imply either the 
continuity of the limit function, even when the functions /„ are continuous, or the 
relation f' n f or /„ (x) dx — »■ /(x) dx, even when all these derivatives and 

integrals are defined. 

Indeed, 

in Example 1 the limit function is discontinuous on [0, 1] although all the prelimit 
functions are continuous there; 

in Example 2 the derivatives n cosn 2 x of the prelimit functions in general do not 
converge, and hence cannot converge to the derivative of the limit function, which 
in this case is identically zero; 

in Example 4 we have / 0 /„(x)dx = 1 for every value of n e N, while 
/„' f(x) dx = 0; 
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in Example 5 each of the functions f m equals zero except at a finite set of points, 
so that f f m (x) dx = 0 on every closed interval [a, b] C R, while the limit function 
V is not integrable on any closed interval of the real line. 

At the same time: 

in Examples 2, 3, and 4 both the prelimit and the limit functions are continuous; 

in Example 3 the limit of the derivatives C0 ^" A of the functions in the sequence 
S '"" A does equal the derivative of the limit of that sequence; 

in Example 1 we have /„ (x) dx — > f(x) dx as n —*■ oo. 

Our main purpose is to determine the cases in which the limiting passage under 
the integral or derivative sign is legal. 

In this connection, let us consider some more examples. 


Example 6 We know that for any x e R 

1 I t— IV" 

sinx=x x 3 H x 5 1 x 2m+1 H , (16.1) 

3! 5! (2m +1)! 

but after the examples we have just considered, we understand that the relations 


sin '* = 

m=0 V 

pt> 00 ^ /» 

/ sin x dx = V 

Ja 


( (-i r t2 , w+1 v 

(2m + 1)! J ’ 


m = 0 “ 


(2m + 1)! 


require verification in general. 
Indeed, if the equality 


(16.2) 

(16.3) 


S(x) = «i(x) + a 2 (x) H 1- a m (x) H 

is understood in the sense that S(x) = lim/^oo S n (x), where S n (x) = Ylm=\ a ' n ( x )> 
then by the linearity of differentiation and integration, the relations 


S' '(x) = Y] a n( x ) 

m= 1 

pb 00 rb 

/ S(x ) dx = 'Y, / a m (-t ; )dx 

Ja m=l Ja 


are equivalent to 


S'(x)= lim S'(x), 

n—>o o 

rb pb 

I 5(x)dx= lim / S„ 

Ja n— >-oo J a 


(x) d.r, 


which we must now look upon with caution. 
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In this case both relations (16.2) and (16.3) can easily be verified, since it is 
known that 


1 2 1 4 

cos.r =1 x H x 

2! 4! 


(2 777 )! 


However, suppose that Eq. (16.1) is the definition of the function sin.r. After all, 
that was exactly the situation with the definition of the functions sin 4 , cos z, and 
e z for complex values of the argument. At that time we had to get the properties of 
the new function (its continuity, differentiability, and integrability), as well as the 
legality of the equalities (16.2) and (16.3) directly from the fact that this function is 
the limit of the sequence of partial sums of this series. 

The main concept by means of which sufficient conditions for the legality of the 
limiting passages will be derived in Sect. 16.3, is the concept of uniform conver- 
gence. 


16.1.3 Convergence and Uniform Convergence of a Family 
of Functions Depending on a Parameter 

In our discussion of the statement of the problems above we confined ourselves 
to consideration of the limit of a sequence of functions. A sequence of functions 
is the most important special case of a family of functions f,(x ) depending on a 
parameter t. It arises when t e N. Sequences of functions thus occupy the same place 
occupied by the theory of limit of a sequence in the theory of limits of functions. 
We shall discuss the limit of a sequence of functions and the connected theory of 
convergence of series of functions in Sect. 16.2. Here we shall discuss only those 
concepts involving functions depending on a parameter that are basic for everything 
that follows. 

Definition 5 We call a function (x, t ) F(x, t ) of two variables x and t defined 
on the set X x T a. family of functions depending on the parameter t if one of the 
variables t e T is distinguished and called the parameter. 

The set T is called the parameter set or parameter domain, and the family it- 
self is often written in the form ft(x) or {/,; t e T}, distinguishing the parameter 
explicitly. 

As a rule, in this book we shall have to consider families of functions for which 
the parameter domain T is one of the sets N or R or C of natural numbers, real num- 
bers, or complex numbers or subsets of these. In general, however, the set T may be 
a set of any nature. Thus in Examples 1-5 above we had T — N. In Examples 1-4 
we could have assumed without loss of content that the parameter n is any positive 
number and the limit was taken over the base n —*■ 00 , n e R + . 
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Definition 6 Let {f, : X — >• R; t e T] be a family of functions depending on a pa- 
rameter and let B be a base in the set T of parameter values. 

If the limit lirng / ? (x) exists for a fixed value ,r e X, we say that the family of 
functions converges at x . 

The set of points of convergence is called the convergence set of the family of 
functions in a given base B. 


Definition 7 We say that the family of functions converges on the set E C X over 
the base B if it converges over that base at each point x e E. 

The function f(x) := limg / f (x) on E is called the limit function or the limit of 
the family of functions f t on the set E over the base B. 


Example 7 Let ft(x) = e f-x/rf 2 ^ x e X = R, t e T — R\0, and let B be the base 
t -> 0. This family converges on the entire set R, and 


lim f t {x) — 

t^O 


1, if x = 0, 
0, ifx^O. 


We now give two basic definitions. 


Definition 8 We say that the family {f t ; t e T] of functions f t \X—> R converges 
pointwise (or simply converges ) on the set E C X over the base B to the function 
/ : E — > R if limg f t (x) — f(x) at every point x e E. 


In this case we shall often write ( f, — > f on E). 

B 

Definition 9 The family { /,: t e T] of functions f t : X -> R converges uniformly 
on the set E C X over the base B to the function /:£—>■ R if for every s > 0 there 
exists an element B in the base B such that |/(x) — ft(x)\ < e at every value t e B 
and at every point x e E. 


In this case we shall frequently write (/, =4 / on E). 

B ' 

We give also the formal expression of these important definitions: 

^ ft f on E^ Vs > 0 Vx e E 3B e B Vt e B (|/(x) — f t (x) \ < e), 
f/^/onf] :=Vs > 0 3B e BVx e E Vt e B (|/(x) — f t (x) < s). 

The relation between convergence and uniform convergence resembles the rela- 
tion between continuity and uniform continuity on a set. 

To explain better the relationship between convergence and uniform convergence 
of a family of functions, we introduce the quantity A r (x) = \f(x) — /?(x)|, which 
measures the deviation of the value of the function f, from the value of the func- 
tion / at the point x e E. Let us consider also the quantity A, = sup ts£ A, (x), 
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which characterizes, roughly speaking, the maximum deviation (although there may 
not be a maximum) of the function f t from the corresponding values of / over all 
x e E. Thus, at every point x e E we have A t (x) < A t . 

In this notation these definitions obviously can be written as follows: 

Wx e E (At (x ) — »■ 0 over B ) , 

(A t — »■ 0 over B). 

It is now clear that 

°n E ')■ 

that is, if the family f t converges uniformly to / on the set E , it converges pointwise 
to / on that set. 

The converse is in general not true. 

Example 8 Let us consider the family of functions f, : / — > R defined on the 
closed interval / = {reR|0<x<l) and depending on the parameter t e ]0, 1], 
The graph of the functions y = f t (x ) is shown in Fig. 16.1. It is clear that 
lim,^o ft(x) — 0 at every point x e /, that is, f t — > f = 0 as t —*■ 0. At the same 
time A t = sup vg/ \f(x) — ft(x)\ = sup tg/ \ft(x)\ = 1, that is, A t 0 as t —*■ 0, 
and hence the family converges, but not uniformly. 

In such cases we shall say for convenience that the family converges nonuni- 
fonnly to the limit function. 

If the parameter t is interpreted as time, then convergence of the family of func- 
tions ft on the set E to the function / means that for any preassigned precision 
e > 0 one can exhibit a time t e for each point x e E starting from which (that is, for 
t > t e ) the values of all functions f t at x will differ from / (x ) by less than e. 

Uniform convergence means that there is a time t s , starting from which (that is, 
for t > t e ) the relation | f(x) — ft(x)\ < s holds for all x e E. 

The figure of a traveling bulge of large deviation depicted in Fig. 16.1 is typical 
for nonuniform convergence. 


ff f 011 E ) ■= 
( fi ^ / 011 E ) ■= 
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Example 9 The sequence of functions /„ (x ) = x" — x 2n defined on the closed in- 
terval 0 < x < 1 , as one can see, converges to zero at each point as n — > oo. 
To determine whether this convergence is uniform, we find the quantity A n — 
maxo^^i |/„(x)|. Since // (x) = «x' !_1 (l — 2x n ) = 0 for x = 0 and x = 2 _1 /' ! , 
it is clear that A n — /„( 2 -1 '") = 1/4. Thus A n 0 as n — >• oo and our sequence 
converges to the limit function fix) = 0 nonuniformly. 


Example 10 The sequence of functions f n = x" on the interval 0 < x < 1 converges 
to the function 


fix) = 


0, if 0 < x < 1 . 

1 , if x = 1 


nonuniformly, since for each n e N 


An = sup | fix)- fn ix) \ = SUp | /(*)-/„ (x) | = 
0<X<] 0<x<l 

= sup \f n (x)\= sup |x"| = l. 

0<x<l 0<x<l 


2 

Example 11 The sequence of functions /„(x) = sin ” x studied in Example 2 con- 
verges to zero uniformly on the entire set M as n —*■ oo, since in this case 

| fix) - fnix) | = \fnix) \ - 
that is, A n < l/n, and hence A n —> 0 as n -* oo. 


sin n z x 1 

< — , 

n n 


16.1.4 The Cauchy Criterion for Uniform Convergence 

In Definition 9 we stated what it means for a family of functions f, to converge uni- 
formly on a set to a given function on that set. Usually, when the family of functions 
is defined the limit function is not yet known, so that it makes sense to adopt the 
following definition. 

Definition 10 We shall say that the family { f , : t e T} of functions f,:X—> P. con- 
verges on the set E C X uniformly over the base B if it converges on that set and the 
convergence to the resulting limit function is uniform in the sense of Definition 9. 

Theorem (Cauchy criterion for uniform convergence) Let { /,; t e T] be a family 
of functions f t : X — > R. depending on a parameter t e T, and B a base in T . 
A necessary and sufficient condition for the family { f t \t &T} to converge uniformly 
on the set E C X over the base B is that for every e > 0 there exists an element B 
of the base B such that \ffx) — f-,ix)\ < s for every value of the parameters 
t \ , t 2 G B and every point x G E. 
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In formal language this means that f converges uniformly on E over the base 
B Ve > 0 3B e BWti, t 2 e B Vx e E (| f tl (x) — f 2 (x ) \ < e). 

Proof Necessity. The necessity of these conditions is obvious, since if / : E — »■ K 
is the limit function and f t f on E over B, there exists an element B in the base 
B such that |/(x) — ft(x)\ < e/2 for every t e B and every x e E. Then for every 
t \ , t 2 £ B and every x e E we have 

| .ft 1 (*) ~ fh U) | < | fix) - f h (x) I + I fix) - f, 2 (x) I < e/2 + e/2 = e. 

Sufficiency. For each fixed value of x e E we can regard fix) as a function of 
the variable t e T . If the hypotheses of the theorem hold, then the hypotheses of 
the Cauchy convergence criterion for the existence of a limit over the base B are 
fulfilled. 

Hence, the family {f t ; t e T} converges at least pointwise to some function / : 
E — > K on the set E over the base B. 

If we now pass to the limit in the inequality \f t] (x) — /? 2 (x)| < e, which 
is valid for any t\ and t 2 € B and every x e E, one can obtain the inequality 
|/(x) — f i (x)| < e for every t 2 € B and every x e E, and this, up to an inessential 
relabeling and the change of the strict inequality to the nonstrict, coincides exactly 
with the definition of uniform convergence of the family {f t \te 7 } to the function 
/:£—>] R on the set E over the base B. □ 

Remark 1 The definitions of convergence and uniform convergence that we have 
given for families of real-valued functions f t : X —*■ R of course remain valid for 
families of functions f t \X—>Y with values in any metric space Y . The natural 
modification that one must make in the definitions in this case amounts to replacing 
| fix) — ft(x)\ by dy (/(x), fix)), where dy is the metric in Y . 

For normed vector spaces Y, in particular for Y = C or Y — R'" or Y = C m , even 
these formal changes are not needed. 

Remark 2 The Cauchy criterion of course remains valid for families of functions 
f : X — > Y with values in a metric space Y provided Y is a complete metric space. 
As can be seen from the proofs, the hypothesis that Y be complete is needed only 
for the sufficiency part of the criterion. 


16.1.5 Problems and Exercises 

1. Determine whether the sequences of functions considered in Examples 3-5 con- 
verge uniformly. 

2. Prove Eqs. (16.2) and (16.3). 

3. a) Show that the sequence of functions considered in Example 1 converges uni- 
formly on every closed interval [0, 1 — 5] c [0, 1], but converges nonuniformly on 
the interval [0, 1[. 
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b) Show that the same is true for the sequence considered in Example 9. 

c) Show that family of functions f t considered in Example 8 converges uni- 
formly as t — > 0 on every closed interval [<5, 1] C [0. 1] but nonuniformly on [0, 1], 

d) Investigate the convergence and uniform convergence of the family of func- 
tions ft(x) = sin(fx) as t — »■ 0 and then as t -> oo. 

e) Characterize the convergence of the family of functions fix) = e~ tx as t -* 
+oo on an arbitrary fixed set E C R. 

4. a) Verify that if a family of functions converges (resp. converges uniformly) on 
a set, then it also converges (resp. converges uniformly) on any subset of the set. 

b) Show that if the family of functions f t : X — > R converges (resp. converges 
uniformly) on a set E over a base B and g : X — »■ R is a bounded function, then 
the family g ■ f, : X — >■ R also converges (resp. converges uniformly) on E over the 
base B. 

c) Prove that if the families of functions f t : X — » R, g, : X — > M converge uni- 
formly on E C X over the base B , then the family h, = af t + fg t , where a. (5 e M, 
also converges uniformly on E over B. 

5. a) In the proof of the sufficiency of the Cauchy criterion we passed to the limit 
li mg ft j (x) = /(x) over the base B in T . But t\ e B, and B is a base in T, not in B . 
Can we pass to this limit in such a way that t\ remains in B ? 

b) Explain where the completeness of R was used in the proof of the Cauchy 
criterion for uniform convergence of a family of functions /, : X —> R. 

c) Notice that if all the functions of the family {/, : X —*■ R; t e T} are constant, 
then the theorem proved above is precisely the Cauchy criterion for the existence of 
the limit of the function <p:T —*■ R over the base B in 7'. 

6. Prove that if the family of continuous functions f, e C(I, R) on the closed inter- 
val / = {x e R | a < x < b] converges uniformly on the open interval ]a. /;[, then it 
converges uniformly on the entire closed interval [a, b]. 


16.2 Uniform Convergence of Series of Functions 

16.2.1 Basic Definitions and a Test for Uniform Convergence 
of a Series 

Definition 1 Let {a„ : X C; n e N} be a sequence of complex- valued (in particu- 
lar real-valued) functions. The series a n(x) converges or converges uniformly 
on the set E C X if the sequence {s m (x) = Yfn = t ««(*); n e N} converges or con- 
verges uniformly on E . 

Definition 2 The function s m (x) = ffn= \ a n( x ), as in the case of numerical se- 
ries, is called the partial sum or, more precisely, the mth partial sum of the series 

XT =1 a„(x). 
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Definition 3 The sum of the series is the limit of the sequence of its partial sums. 
Thus, writing 

oo 

s (a') = a„ (x) on E 

n = 1 

means that s m (x ) -> s(jc) on E as m — » oo, and writing 

OO 

the series ^^a n (x) converges uniformly on E 

n = 1 

means that s m {x) =4 s(x) on E as m — >■ oo. 

Investigating the pointwise convergence of a series amounts to investigating the 
convergence of a numerical series, and we are already familiar with that. 

Example 1 Earlier we defined the function exp : C — >■ C by the relation 

oo i 

expz:=y^ — z n , (16.4) 

z — ' n\ 

n= 0 

after first verifying that the series on the right converges for every value zeC. 

In the language of Definitions 1-3 one can now say that the series (16.4) of 
functions a n (z) = piZ n converges on the entire complex plane, and the function 
exp z is its sum. 

By Definitions 1 and 2 just adopted a two-way connection is established between 
series and their sequences of partial sums: knowing the terms of the series, we obtain 
the sequence of partial sums, and knowing the sequence of partial sums, we can 
recover all the terms of the series: the nature of the convergence of the series is 
identified with the nature of the convergence of its sequence of partial sums. 

Example 2 In Example 5 of Sect. 16.1 we constructed a sequence m e N} of 
functions that converge to the Dirichlet function T>(x) on R. If we set a\ (x ) = f\ (x) 
and a n (x) = f n (x) — f„-i(x) for n > 1, we obtain a series a„(x) that will 

converge on the entire number line, and a n (x) — 'D(x). 

Example 3 It was shown in Example 9 of Sect. 16.1 that the sequence of functions 
f n (x) = x" — x 2 " converges, but nonuniformly, to zero on the closed interval [0, 1]. 
Hence, setting a\{x) = f\ (x) and a n (x ) = f„(x) — /„_ i(x) for n > 1, we obtain a 
series «„ (x) that converges to zero on the closed interval [0, 1], but converges 
nonuniformly. 

The direct connection between series and sequences of functions makes it pos- 
sible to restate every proposition about sequences of functions as a corresponding 
proposition about series of functions. 

Thus, in application to the sequence {s n : X -* C; n e N} the Cauchy criterion 
proved in Sect. 16.1 for uniform convergence of a sequence on a set E C X means 
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that 


Ve > 0 3N e N Vnj, «2 > N V.v e E (|.y„, (x) — s ni (x) \ < e). (16.5) 

From this, taking account of Definition 1, we obtain the following theorem. 

Theorem 1 (Cauchy criterion for uniform convergence of a series) The series 
E/” i a n( J ) converges uniformly on a set E if and only if for every e > 0 there 
exists IV e N such that 

\a n (x)-\ h O) I <£, (16.6) 

for all natural numbers m, n satisfying m >n>N and every point x e E. 

Proof Indeed, setting n \ — m, nj = n — 1 in (16.5) and assuming that s n (x ) is the 
partial sum of the series, we obtain inequality (16.6), from which relation (16.5) in 
turn follows with the same notation and hypotheses of the theorem. □ 

Remark 1 We did not mention the range of values of the functions a n (x ) in the 
statement of Theorem 1, taking for granted that it was R or C. But actually the 
range of values could obviously be any normed vector space, for example, M" or C" , 
provided only that the space is complete. 

Remark 2 If under the hypotheses of Theorem 1 all the functions a n {x) are con- 
stant, we obtain the familiar Cauchy criterion for convergence of a numerical series 

E CO 

h=1««- 

Corollary 1 (Necessary condition for uniform convergence of a series) A necessary 
condition for the series a n (x) to converge uniformly on a set E is that a n 0 
on E as n — > oo. 

Proof This follows from the definition of uniform convergence of a sequence to 
zero and inequality (16.6) if we set m = n in it. □ 

Example 4 The series (16.4) converges on the complex plane C nonuniformly, since 
sup TsC \-^Z n \ = oo for every neN, while by the necessary condition for uniform 
convergence, the quantity sup xe£ \a n (x)\ must tend to zero when uniform conver- 
gence occurs. 

Example 5 The series Y^,nLi f-, as we know, converges in the unit disk K ={ze 
C | |z| < 1}. Since < - for z e K , we have 0 on K as n — »■ oo. The neces- 
sary condition for uniform convergence is satisfied; however, this series converges 
nonuniformly on K. In fact, for any fixed n e N, by the continuity of the terms of 
the series, if z is sufficiently close to 1, we can get the inequalities 


z n 

z 2n 

1 

1 

1 

— h- 

• + — — 

> - 

- + ■ 

• “I” — — 

n 

2 n 

2 

n 

2 n 


1 

> -. 
4 
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From this we conclude by the Cauchy criterion that the series does not converge 
uniformly on K . 


16.2.2 The Weierstrass M -Test for Uniform Convergence 
of a Series 


Definition 4 The series a n (x) converges absolutely on the set E if the corre- 
sponding numerical series converges absolutely at each point x e E. 

Proposition 1 If the series YfrLi a n(x) and b, ,(x) are such that \a n (x)\ < 

b n (x) for every x G E and for all sufficiently large indices n e N, then the uniform 
convergence of the series b n (x) on E implies the absolute and uniform con- 

vergence of the series a n (x) on the same set E. 

Proof Under these assumptions for all sufficiently large indices n and m (let n < m ) 
at each point x e E we have 

| a n (x) H \-a m (x) \ < |fl„(x)| -| b |a m (x)| < 

< b n (x) H b b m (x ) = | b n (x) H F b m (x)\. 


By the Cauchy criterion and the uniform convergence of the series h n (x ) , 

for each e > 0 we can exhibit an index N e N such that | b n (x) + ■ ■ ■ + b m (x) \ < e 
for all m > n > N and all x e E. But then it follows from the inequalities just 
written and the Cauchy criterion that the series a n (x) and X^«^=i \ a n( x )\ both 

converge uniformly. □ 

Corollary 2 (Weierstrass’ M-test for uniform convergence of a series) If for the 
series Y^nLi a nW one can exhibit a convergent numerical series such 

that sup ts£ - \a n (x)\ < M n for all sufficiently large indices n e N, then the series 
a n {x) converges absolutely and uniformly on the set E. 

Proof The convergent numerical series can be regarded as a series of constant func- 
tions on the set E, which by the Cauchy criterion converges uniformly on E. Hence 
the Weierstrass test follows from Proposition 1 if we set b n (x ) = M n in it. □ 

The Weierstrass M-test is the simplest and at the same time the most frequently 
used sufficient condition for uniform convergence of a series. 

As an example of its application, we prove the following useful fact. 

Proposition 2 If a power series YftLo c nU — Zo) n converges at a point t; zq> 
then it converges absolutely and uniformly in any disk K q = {z e C | \z — Zo\ < 
q\% — zolK where 0 < q < 1. 
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Proof By the necessary condition for convergence of a numerical series it fol- 
lows from the convergence of the series ^fnLo c « — zo)" that c„ (£ — z) n -> 0 
as n -*■ oo. Hence for all sufficiently large values of n e N we have the estimates 
I Cn(z - zo) n \ = |c„(f - Zo) n \ ■ IfEf I" < k„(? - zo)"l • ?" < q n in the disk K q . 
Since the series converges for \d\ < 1, the estimates |c„(z — zo)"l < q n 

and the Weierstrass M-test now imply Proposition 2. □ 

Comparing this proposition with the Cauchy-Hadamard formula for the radius 
of convergence of a power series (see Eq. (5.115)), we arrive at the following con- 
clusion. 

Theorem 2 (Nature of convergence of a power series) A power series Y^Lo c « k — 
zo) n converges in the disk A' = {zeC||z — zol<^} whose radius of convergence 
is determined by the Cauchy-Hadamard formula^ R — (lim„_ s . 00 f/\ c„ \ ) _1 Outside 
this disk the series diverges. On any closed disk contained in the interior of the 
disk K of convergence of the series, a power series converges absolutely and uni- 
formly. 

Remark 3 As Examples 1 and 5 show, the power series need not converge uniformly 
on the entire disk K . At the same time, it may happen that the power series does 
converge uniformly even on the closed disk K . 

Example 6 The radius of convergence of the series ts 1. But if kl < 1, 

then | ^ | < and by the Weierstrass M-test this series converges absolutely and 
uniformly in the closed disk /^ = {zeC||z|<l). 


16.2.3 The Abel-Dirichlet Test 

The following pairs of related sufficient conditions for uniform convergence of a 
series are somewhat more specialized and are essentially connected with the real- 
valuedness of certain components of the series under consideration. But these con- 
ditions are more delicate than the Weierstrass M-test, since they make it possible to 
investigate series that converge, but nonabsolutely. 

Definition 5 The family T of functions / : X —*■ C is uniformly bounded on a set 
E C X if there exists a number Mel such that sup vg£ |/(x)| < M for every 
fcH. 


'in the exceptional case when lim„^oo \/|c„| = oo, we take R = 0 and the disk K degenerates to 
the single point zq • 
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Definition 6 The sequence of functions {b n : X — > M; n e N} is called non- 
decreasing (resp. nonincreasing) on the set E C X if the numerical sequence 
{b n (x)\n e N} is nondecreasing (resp. nonincreasing) for every x e E. Nondecreas- 
ing and nonincreasing sequences of functions on set are called monotonic sequences 
on the set. 

We recall (if necessary, see Sect. 5.2.3) the following identity, called Abel’s trans- 
formation: 

m m — 1 

'y ] a k b k — A m b m A n —\b n T- ^ ' A k (b k b k ~ j-i ) , (16.7) 

k=n k=n 

where ak = A k — A k ~ \,k = n , ... ,m. 

If b n , b n +i , . . . , b m is a monotonic sequence of real numbers, then, even if 
a n , a n+] ■ . ■ ■ , a m are complex numbers or vectors of a normed space, one can obtain 
the following estimate, which we need, from the identity (16.7): 

<4 max \Ak\ ■ max[|ft„|, \b m \\. (16.8) 

n—l<k<m 


m 

k=n 


Proof In fact, 


I A m bm I “1“ I A n — i b n \ -\- 


m — 1 

T. A k (b k - bk- 1) 

k=n 


< 


( m— 1 

\b m \ + \b n \ + Y] \b k -b k+i \ 

£ 

k=n 

= max \A k \ ■ (\b m \ + \b n \ + \b„ - b m \) < 

n—l<k<m 

<4 max | A* | • max (\b n \, \b m \). 

n—l<k<m 

In the equality that occurs in this computation we used the monotonicity of the 
numerical sequence b k . □ 



Proposition 3 (The Abel-Dirichlet test for uniform convergence) A sufficient con- 
dition for uniform convergence on E of a series a n (x)b n (x) whose terms 

are products of complex-valued functions a n : X — > C and real-valued functions 
b n : X — > R. is that either of the following pairs of hypotheses be satisfied: 

u\) the partial sums s k (x) = Yii= i a n(x) of the series YnL i a n(*) are uni- 
formly bounded on E ; 

fil) the sequence of functions b n (x) tends monotonically and uniformly to zero 
on E\ 
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or 

i * 2 ) the series a n (x) converges uniformly on E ; 

^ 2 ) sequence of functions b„(x) is monotonic and uniformly bounded on E . 

Proof The monotonicity of the sequence b n (x) allows us to write an estimate anal- 
ogous to (16.8) for each x e E: 

<4 max | A*;(x)| • max{ |fc„(jc)|, |& m (*)|}> (16.8') 

n—\<k<m 

where we take s k (x) — s n - i(x) as A k (x). 

If the hypotheses u \ ) and f >\ ) hold, then, on the one hand, there exists a constant 
M such that A k (x ) < M for all k e N and all x e E, while on the other hand, for 
any number s > 0 we have max{|fc„(x)|, \b m (x )\ } < ^ for all sufficiently large n 
and m and all x e E. Hence it follows from (16.8) that | Y^!k=n a k(x)bk(x)\ < s for 
all sufficiently large n and m and all x e If that is, the Cauchy criterion holds for 
this series. 

In the case of hypotheses ( 22 ) and fj) the quantity max{|/7„(x)|, \b m (x)\) is 
bounded. At the same time, by the uniform convergence of the series a n (x) 

and the Cauchy criterion, for every s > 0 we have |Ayt(.x:)| = l^fjc) — ^-„_i(.\-)l < £ 
for all sufficiently large n and k > n and all x e E. Taking this into account, we 
again conclude from (16.8) that the Cauchy criterion for uniform convergence holds 
for this series. □ 


m 

Y a k (x)b k (x) 

k=n 


Remark 4 In the case when the functions a„ and b n are constants Proposition 3 
becomes the Abel-Dirichlet criterion for convergence of numerical series. 


Example 7 Let us consider the convergence of the series 


E 


1 

n a 




(16.9) 


Since 


1 

n a 


e nx 


1 


(16.10) 


the necessary condition for uniform convergence does not hold for the series (16.9) 
when a < 0, and it diverges for every v e R. Thus we shall assume a > 0 from now 
on. 

If a > 1, we conclude from the Weierstrass M-test and (16.10) that the series 
(16.9) converges absolutely and uniformly on the entire real line R. 

To study the convergence for 0 < a < 1 we use the Abel-Dirichlet test, setting 
a n {x) — e lnx and b n (x) = i. Since the constant functions b„(x) are monotonic 
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when a > 0 and obviously tend to zero uniformly for x e R, it remains only to 
investigate the partial sums of the series Y^nLi Q,nx ■ 

For convenience in citing results below, we shall consider the sums Ylk=o e ' kx ’ 
which differ from the sums of our series only in the first term, which is 1 . 

Using the formula for the sum of a finite geometric series and Euler’s formula, 
we obtain successively for x ^ 2jtm, m e Z, 




(16.11) 


Hence, for every n e N 



(16.12) 


from which it follows by the Abel-Dirichlet criterion that for 0 < a < 1 the series 
(16.9) converges uniformly on every set £ C K on which inf AS £ | sin II > 0. In 
particular the series (16.9) simply converges for every x ^ 2: Tin, m e Z. If x — 
2nm, then e ,n2nm — 1, and the series (16.9) becomes the numerical series X, 
which diverges for 0 < a < 1 . 

We shall show that from what has been said, one can conclude that for 0 < a < 1 
the series (16.9) cannot converge uniformly on any set E whose closure con- 
tains points of the form 2tt m, m e Z. For definiteness, suppose 0 e E. The series 
diverges for 0 < a < 1. By the Cauchy criterion, there exists s > 0 such 
that for every N e N, no matter how large, one can find numbers m >n > N such 

that | ^ H f- | > £0 > 0. By the continuity of the functions c lkx on R, it follows 

that one can choose a point x e E close enough to 0 so that 


,111.x 


,1 mx 


e 


e 


+ 


+ 


> £o- 


But by the Cauchy criterion for uniform convergence this means that the series 
(16.9) cannot converge uniformly on E. 

To supplement what has just been said, we note that, as one can see from (16.10), 
the series (16.9) converges nonabsolutely for 0 < a < 1. 

Remark 5 It is useful for what follows to remark that, separating the real and imag- 
inary parts in (16.1 1), we obtain the following relations: 


n 


n • w -I - 1 

cos jx • sin x 


cos kx 

k = 0 


sin | 


(16.13) 
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n 

Y, sin kx 
k=0 


• n • n- 1-1 

sin j-x • sin x 
sin | 


which hold for x 2nm , m e Z. 


(16.14) 


As another example of the use of the Abel-Dirichlet test we prove the following 
proposition. 

Proposition 4 (The so-called second Abel theorem on power series) If a power 
series YnL() c « (z ~ Zo)" converges at a point £ e C, then it converges uniformly on 
the closed interval with endpoints zo cmd £ . 


Proof We represent the points of this interval in the form zo + (f — zo)t, where 
0 < t < 1 . Substituting this expression in the power series, we obtain the series 
J2T=0 c: n(^ — zo ) n t n . By hypothesis, the numerical series E«^o c 'i(£ ~ zo)" con- 
verges, and the sequence of functions t n is monotonic and uniformly bounded on 
the closed interval [0, 1]. Hence conditions aj) and fn) in the Abel-Dirichlet test 
are satished, and the proposition is proved. □ 


16.2.4 Problems and Exercises 

1. Investigate the nature of the convergence on the sets E C M for different values 
of the real parameter a in the following series: 

\ v^ 00 cos nx 
2^n= 1 n a ‘ 
y^°° sinnx 
2^n= 1 

2. Prove that the following series converge uniformly on the indicated sets: 

a) YY=\ for0 - x - L 

b) Yr~ & ~ nx for 0 < x < +00. 

c) E^t w f° r 0 < jc < +00. 

3. Show that if a Dirichlet series YnL 1 jp converges at a point xq e K, then it 
converges uniformly on the set x > xo and absolutely if x > xo + 1 . 

4. Verify that the series 2_, n=1 - Yfyi converges uniformly on K, and the series 

2 

YnLi (\jl_ -2 \n converges on K, but nonuniformly. 

5. a) Using the example of the series from Problem 2, show that the Weierstrass 
M-test is a sufficient condition but not a necessary one for the uniform convergence 
of a series. 

b) Construct a series Yn = 1 a " ( x ) with nonnegative terms that are continuous on 
the closed interval 0 < x < 1 and which converges uniformly on that closed inter- 
val, while the series M n formed from the quantities M — maxo< x <i \a„ (x) 

diverges. 
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6. a) State the Abel-Dirichlet test for convergence of a series mentioned in Re- 
mark 4. 

b) Show that the condition that {b,,} be monotonic in the Abel-Dirichlet test 
can be weakened slightly, requiring only that the sequence {b n } be monotonic up to 
corrections {fi n } forming an absolutely convergent series. 

7. As a supplement to Proposition 4 shows, following Abel, that if a power series 
converges at a boundary point of the disk of convergence, its sum has a limit in that 
disk when the point is approached along any direction not tangential to the boundary 
circle. 


16.3 Functional Properties of a Limit Function 
16.3.1 Specifics of the Problem 

In this section we shall give answers to the questions posed in Sect. 16.1 as to when 
the limit of a family of continuous, differentiable, or integrable functions is a func- 
tion having the same property, and when the limit of the derivatives or integrals of 
the functions equals the derivative or integral of the limiting function of the family. 

To explain the mathematical content of these questions, let us consider, for ex- 
ample, the connection between continuity and passage to the limit. 

Let f n (x) —*■ fix) on R as n — > oo, and suppose that all the functions in the 
sequence {/„: n eff( are continuous at the point xo e R. We are interested in the 
continuity of the limit function / at the same point xq. To answer that question, we 
need to verify the equality lim v _ >A:o fix) — fix o), which in terms of the original se- 
quence can be rewritten as the relation lim v _ KT 0 (lim, i _ i , O o f n ix)) = lim^oo ffx o), 
or, taking account of the given continuity of /„ at xo, as the following relation, 
subject to verification: 



(16.15) 


On the left-hand side here the limit is first taken over the base n oo, then over 
the base x xq, while on the right-hand side the limits over the same bases are 
taken in the opposite order. 

When studying functions of several variables we saw that Eq. (16.15) is by no 
means always true. We also saw this in the examples studied in the two preceding 
sections, which show that the limit of a sequence of continuous functions is not 
always continuous. 

Differentiation and integration are special operations involving passage to the 
limit. Hence the question whether we get the same result if we first differentiate 
(or integrate) the functions of a family, then pass to the limit over the parameter 
of the family or first find the limit function of the family and then differentiate (or 
integrate) again reduces to verifying the possibility of changing the order of two 
limiting passages. 
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16.3.2 Conditions for Two Limiting Passages to Commute 

We shall show that if at least one of two limiting passages is uniform, then the 
limiting passages commute. 

Theorem 1 Let {F,\ t e T] be a family of functions F t : X — > C depending on a 
parameter t; let Bx be a base in X and Bj a base in T . If the family converges uni- 
formly on X over the base Bt to a function F : X — »■ C and the limit lim q x F t [x) — 

A, exists for each t e T, then both repeated limits limg^ (limgj, F,(x)) and 
lirng r (lirng^ F t (x)) exist and the equality 

I i m ( I i m F, (x ) ) = 1 i m ( 1 i m F, (x ) ) (16.16) 

Bx SBr ' B t 'B x ' 


holds. 


This theorem can be conveniently written as the following diagram 


F t (x ) l F(x) 


(16.17) 


Bx 


B t 


B t 


Bx 


-A A 


in which the hypotheses are written above the diagonal and the consequences below 
it. Equality (16.16) means that this diagram is commutative, that is, the final result 
A is the same whether the operations corresponding to passage over the upper and 
right-hand sides are carried out or one first passes down the left-hand side and then 
to the right over the lower side. 

Let us prove this theorem. 

Proof Since F t =1 F on X over Bj , by the Cauchy criterion, for every s > 0 there 
exists Bt in Bj such that 

\F h (x) - F t2 (x)\ < s (16.18) 

for every t\ , ?2 € Bt and every x e X. 

Passing to the limit over Bx in this inequality, we obtain the relation 


\A h -A t2 \<s, (16.19) 

which holds for every t\.t 2 & Bj . By the Cauchy criterion for existence of the limit 
of a function it now follows that A, has a certain limit A over Bt ■ We now verify 
that A — lirng^, F(x). 
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Fixing ?2 € Bj , we find an element B\ in Bx such that 

| F t2 (x) — A, 2 1 < s (16.20) 


for all x e Bx ■ 

Keeping tj fixed, we pass to the limit in (16.18) and (16.19) over Bt with respect 
to t\ . We then find 


| F(x) - F, 2 (x ) | < e, 
\A-A, 2 \ < s, 


(16.21) 

(16.22) 


and (16.22) holds for all x e X. 

Comparing (16.20)— ( 16.22), and using the triangle inequality, we find 

|F(jc) — A| < 3e 

for every x e Bx- We have thus verified that A — lirng^ F(x). □ 

Remark 1 As the proof shows. Theorem 1 remains valid for functions F, : X — >■ Y 
with values in any complete metric space. 

Remark 2 If we add the requirement that the limit lirng r A t = A exists to the hy- 
potheses of Theorem 1, then, as the proof shows, the equality linig^, F (x) = A 
can be obtained even without assuming that the space Y of values of the functions 
F t : X — »■ Y is complete. 


16.3.3 Continuity and Passage to the Limit 


We shall show that if functions that are continuous at a point of a set converge 
uniformly on that set, then the limit function is also continuous at that point. 

Theorem 2 Let {f t \te T} be a family of functions f t :X—^C depending on the 
parameter t: let B be a base in T . If f t =4 / on X over the base B and the functions 
ft are continuous at xo e X. then the function f : X — > C is also continuous at that 
point. 

Proof In this case the diagram (16.17) assumes the following specific form: 


f,{x) f(x) 

B x 


ft(xo) > fix o) 


B 
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Here all the limiting passages except the vertical passage on the right are defined 
by the hypotheses of Theorem 2 itself. The nontrivial conclusion of Theorem 1 that 
we need is precisely that lim x ^ X(J /(*) = f(x 0 ). □ 

Remark 3 We have not said anything specific as to the nature of the set X. In fact it 
may be any topological space provided the base x — >■ xo is defined in it. The values 
of the functions f t may lie in any metric space, which, as follows from Remark 2, 
need not even be complete. 

Corollary 1 If a sequence of functions that are continuous on a set converges uni- 
formly on that set, then the limit function is continuous on the set. 

Corollary 2 If a series of functions that are continuous on a set converges uniformly 
on that set , then the sum of the series is also continuous on the set. 

As an illustration of the possible use of these results, consider the following. 

Example 1 Abel’s method of summing series. 

Comparing Corollary 2 with Abel’s second theorem (Proposition 4 of Sect. 16.2), 
we draw the following conclusion. 

Proposition 1 If a power series YT=o c n (" — Z o) n converges at a point it con- 
verges uniformly on the closed interval [zo, £ ] from zo to t; , and the sum of the series 
is continuous on that interval. 

In particular, this means that if a numerical series c„ converges, then the 

power series ^fnLo c " x " converges uniformly on the closed interval 0 < x < 1 of 
the real axis and its sum s(x) = c n x n is continuous on that interval. Since 

s( 1) = £“ 0 c„, we can thus assert that if the series YnLo c n converges, then the 
following equality holds: 


OO 


OO 


n= 0 


lim 

> 1— 0 


n = 0 


(16.23) 


It is interesting that the right-hand side of Eq. (16.23) may have a meaning even 
when the series on the left diverges in its traditional sense. For example, the series 
1 — 1 + 1 — ■ ■ • corresponds to the series x — x 2 + x 3 — • ■ • , which converges to 
x/(l + x) for |x| < 1. As x —*■ 1, this function has the limit 1/2. 

The method of summing a series known as Abel summation consists of ascribing 
to the left-hand side of (16.23) the value of the right-hand side if it is defined. We 
have seen that if the series c " converges in the traditional sense, then its clas- 

sical sum will be assigned to it by Abel summation. At the same time, for example, 
Abel’s method assigns to the series ^^ 0 (— 1)", which diverges in the traditional 
sense, the natural average value 1 /2. 

Further questions connected with Example 1 can be found in Problems 5-8 be- 
low. 
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Example 2 Earlier, when discussing Taylor’s formula, we showed that the following 
expansion holds: 


„ ol a(a — 

(l+*)" = l + - i y» + -^j- 




a(a — 1) ■ • • (a — n + 1) 


+ ••• - 
(16.24) 


We can verify that for a > 0 the numerical series 


a 

1+ v. 


a(a — 1) 

2 ! 


a(a — 1) • ■ - (a — n + 1) 


converges. Hence by Abel’s theorem, if a > 0, the series (16.24) converges uni- 
formly on the closed interval 0 < x < 1. But the function (1 + x) a is continuous at 
x = 1 , and so one can assert that if a > 0, then Eq. (16.24) holds also for x = 1 . 

In particular, we can assert that for a > 0 


(i _,y = i_ V + ° ( “- 1) ,-_ 

V ’ 1! 2! 

„ a (a — 1) ■•■(or — n + 1) 

+ (-l)' ! -— — + (16.25) 

n\ 

and this series converges to (1 — t 2 ) 01 uniformly on [—1, 1]. 

Setting a = \ and t 2 = 1 — x 2 in (16.25), for \x \ < 1 we find 


If I — l) 

2 - 


i -fi( i - 2 )+ 


„ 2\2 


(16.26) 


and the series of polynomials on the right-hand side converges to |x| uniformly on 
the closed interval [—1, 1], Setting P n {x) := S„ (x) — S n (0) , where S„ (x) is the nth 
partial sum of the series, we find that for any prescribed tolerance e > 0 there is a 
polynomial P(x) such that /TO) = 0 and 

max llxl — P(x)| < e. (16.27) 


Let us now return to the general theory. 

We have shown that continuity of functions is preserved under uniform passage 
to the limit. The condition of uniformity in passage to the limit is, however, only 
a sufficient condition in order that the limit of continuous functions also be a con- 
tinuous function (see Examples 8 and 9 of Sect. 16.1). At the same time there is a 
specific situation in which the convergence of continuous functions to a continuous 
function guarantees that the convergence is uniform. 


Proposition 2 (Dini’s 2 theorem) If a sequence of continuous functions on a com- 
pact set converges monotonically to a con tinuous function, then the convergence is 
uniform. 


2 U. Dini (1845-1918) - Italian mathematician best known for his work in the theory of functions. 
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Proof For definiteness suppose that /„ is a nondecreasing sequence converging 
to /. We fix an arbitrary s > 0, and for every point x of the compact set K we 
find an index n x such that 0 < fix) — fn x (x) < s. Since the functions / and f„ x 
are continuous on K , the inequality 0 < f(fi) — fn x (f) < s holds in some neigh- 
borhood U (x) of x e K . From the covering of the compact set K by these neigh- 
borhoods one can extract a finite covering U(x \), . . . , U(xk) and then fix the in- 
dex n(s) = max{n V1 , . . . , n Xk }. Then for any n > n(s), by the fact that the sequence 
{ f n ; n e N} is nondecreasing, we have 0 < fifi) — f n (f) < £ at every point £ e K . □ 

Corollary 3 If the terms of the series a n(x) are nonnegative functions a n : 

K — > K. that are continuous on a compact set K and the series converges to a 
continuous function on K , then it converges uniformly on K . 

Proof The partial sums s„{x) — Y^k=\ a k(x) of this series satisfy the hypotheses of 
Dini’s theorem. □ 

Example 3 We shall show that the sequence of functions f„(x) — n( 1 — .r 1 /") tends 
to f(x) — In | as n -* +oo uniformly on each closed interval [a, b] contained in 
the interval 0 < x < oo. 

Proof For fixed x > 0 the function x‘ — e' lnx is convex with respect to t, so that the 

t 0 

ratio » (the slope of the chord) is nonincreasing as t -> +0 and tends to In a\ 
Hence f n (x) f In for x > 0 as n -» +oo. By Dini’s theorem it now follows 
that the convergence of f n (x) to In - is uniform on each closed interval [a , /;] C 
]0, +oo[. □ 

We note that the convergence is obviously not uniform on the interval 0 < x < 1 , 
for example, since In ^ is unbounded in that interval, while each of the functions 
f n (x) is bounded (by a constant depending on n). 


16.3.4 Integration and Passage to the Limit 

We shall show that if functions that are integrable over a closed interval converge 
uniformly on that interval, then the limit function is also integrable and its integral 
over that interval equals the limit of the integrals of the original functions. 

Theorem 3 Let {/,; f e T } be a family of functions f t : [a, b] —>■ C defined on a 
closed interval a < x < b and depending on the parameter t e T , and let B be a 
base in T. If the functions of the family are integrable on [ a.b ] and f t =4 / on 
[a, b] over the base B. then the limit function f : [a.b]— > C is also integrable on 
[a, b] and 
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Proof Let p = (P be a partition P of the closed interval [ a , b] with distinguished 
points § = {£ 1 , . . . , Consider the Riemann sums F t (p) = Yfi=i ft (Mi) Ax t, t e 
T , and F(p) = Yfi= 1 f (Mi ) Axj . Let us estimate the difference F(p) — F t (p). Since 
/, / on [a, /;] over the base B, for every e > 0 there exists an element B of B 

such that | fix) — ft(x) | < -A- at any t e B and any point x e [a, b], Hence for 
t e B we have 


\F(p)-F t (p)\ 


~ f‘&))Axi 
i=i 


i=i 


and this estimate holds not only for every t e B, but also for every partition p in the 
set V = \( P , |)} of partitions of the closed interval [ a , b] with distinguished points. 
Thus F, =$ F on V over the base B. Now, taking the traditional base /. ( P ) — > 0 
in V , we find by Theorem 1 that the following diagram is commutative: 


J2ft&) Ax i --Flip) F(p):= J2f&)Axi 


i=i 


MP)^0 


i=l 
HP)^ 0 
b 


/ fi(x)dx ='■ A, > A: = fttodx 

Ja B Ja 


which proves Theorem 3. 


□ 


Corollary 4 If the series fn ( x ) consisting of integrable functions on a closed 

inter\>al [a, b] ct converges uniformly on that closed interval , then its sum is also 
integrable on [a, b] and 



f n (x)dx. 


Example 4 When we write in this example, we shall assume that this ratio 
equals 1 when x = 0. 

We have noted earlier that the function Si(x) = ff d? is not an elementary 
function. Using the theorem just proved, we can nevertheless obtain a very simple 
representation of this function as a power series. 

To do this, we remark that 


shu = “ (— 1)" 

t Z -'(2« + l)! 

n = 0 


(16.28) 


and the series on the right-hand side converges uniformly on every closed interval 
[—a, a ] C R. The uniform convergence of the series follows from the Weierstrass 
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if [ 2n 2n 2 n 

M-test, since < ( 2 „+d\ f° r l ? l — while the numerical series 2^, != o (fi+iv. 

converges. 

By Corollary 4 we can now write 


Si(x) = 


r x / °° 

4 (.?. 

sir 


( - 1} 'V W = 


(2 n + 1)! 

(-D n 

(2n + 1)! 


+2n , 


* =E 


n=0 


(-l) n x 2n+1 
(2n+l)!(2n + l)' 


The series just obtained also turns out to converge uniformly on every closed 
interval of the real line, so that, for any closed interval [a,b] of variation of the 
argument x and any preassigned absolute error tolerance, one can choose a polyno- 
mial - a partial sum of this series - that makes it possible to compute Si(x) with less 
than the given error at every point of the closed interval [a, b]. 


16.3.5 Differentiation and Passage to the Limit 


Theorem 4 Let { f t ; t € T) be a family of functions f, : X — > C defined on a con- 
vex bounded set X (in R, C, or any other normed space) and depending on the 
parameter t e T\ let B be a base in T . If the functions of the family are differen- 
tiable on X, the family of derivatives {//; t e T] converges uniformly on X to a 
function tp : X — >■ C, and the original family {/,; f £ T } converges at even one point 
xq € X, then it converges uniformly on the entire set X to a differentiable function 
f : X —*■ C, and f = tp. 

Proof We begin by showing that the family {/,; t e T} converges uniformly on the 
set X over the base B. We use the mean-value theorem in the following estimates: 

| ftfx) - f, 2 (x ) | < 

< |(/fito - ft 2 W) - {ftfx o) - ft 2 C*o)) | + \ ftfxo) - ft 2 (x o) | < 

< sup | f t [ (f ) - fl 2 (?) I \x - xq\ + I f h (xo) - f t2 (. xo ) I = A (x, h , tf) . 

fs[jc 0 ,J:] 

By hypothesis the family {//; t e T] converges uniformly on X over the base B. 
and the quantity f,(x o) has a limit over the same base as a function of t, while 
\x — A'o is bounded for x e X. By the necessity part of the Cauchy criterion for 
uniform convergence of the family of functions // and the existence of the limit 
function f t (x o), for every s > 0 there exists IS in B such that A(x, t\ . t 2 ) < s for 
any t\ , t 2 e B and any x <= X . But, by the estimates just written, this means that the 
family of functions [f t ; t e T) satisfies the hypotheses of the Cauchy criterion and 
consequently converges on X over the base B to a function / : X — > C. 
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Again using the mean-value theorem, we now obtain the following estimates: 

| {fn (x + h) - f h (x) - /,' ( x)h ) - ( f, 2 (x + h) - f t2 (x) - fl 2 (x)h) | = 

= | (f tl - f, a )(x + h) - (. f h - f l2 )(x) - (/ tl - f, 2 )'(x)h\ < 

< sup I (f h - f t2 )\x + eh)\\h\ + 1 (/„ - f t2 )\x)\\h\ = 

O<0<1 

= ( sup \ f;( X + 6h)-ti(x + 6h)\ + \f;(x)-f; 2 (x)\)\h\. 

V O<0<1 / 

These estimates, which are valid for x, x + h e X show, in view of the uniform 
convergence of the family {//; t <= T\ on X, that the family {F t \ t e T} of functions 

P ^ _ ft(x + h) - f,(x ) - f,'(x)h 
\h\ 

which we shall consider with a fixed value of x e X, converges over the base B 
uniformly with respect to all values of h ^ 0 such that x + h e X. 

We remark that F t (h) — > 0 as h — > 0 since the function f, is differentiable at 
the point x e X; and since f t — »■ / and // -» (p over the base B, we have Ffh) — > 
F{h) = n-+h)-fB)-vix)h over the base B 

Applying Theorem 1 , we can now write the commutative diagram 


ft(x+h)— f t (x)— (x)h 

1*1 


=: F t (h) =£ F(h) := 
B / 


h^O 


h^O 


A 0 


ft(x+h)~ f(x)-ip(x)h 

1*1 


The right-hand limiting passage as h -* 0 shows that / is differentiable at x e X 
and f'(x) = <p(x). □ 

Corollary 5 If the series f n (x ) of functions f n : X — »■ C that are differen- 

tiable on a bounded convex subset X ( contained in R, C, or any other normed 
vector space ) converges at even one point x e X and the series f„(x) con- 
verges uniformly on X , then f n (x) also converges uniformly on X, its sunt is 

differentiable on X, and 


y^Jn(x) 


w = X!^( a ')- 

n = 1 


This follows from Theorem 4 and the definitions of the sum and uniform conver- 
gence of a series, together with the linearity of the operation of differentiation. 
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Remark 4 The proofs of Theorems 3 and 4, like the theorems themselves and their 
corollaries, remain valid for functions f t \X-*-Y with values in any complete 
normed vector space Y. For example, Y may be R, C, R”, C", C[a,b], and so 
on. The domain of definition X for the functions f t in Theorem 4 also may be any 
suitable subset of any normed vector space. In particular, X may be contained in R, 
C, R", or C". For real- valued functions of a real argument (under additional con- 
vergence requirements) the proofs of these theorems can be made even simpler (see 
Problem 11). 

As an illustration of the use of Theorems 2-4 we shall prove the following propo- 
sition, which is widely used in both theory and in specific computations. 

Proposition 3 Let K C C be the convergence disk for a power series o Cn ( z ~ 

zo)" ■ If K contains more than just the point zo, then the sum of the series f(z) is 
differentiable inside K and 


/'(z) = X>c„(z-zo)" ’• 


(16.29) 


n = 1 


Moreover, the function f(z):K—>C can be integrated over any path y : 
[0, 1] — »■ K , and if[ 0, 1] 3 1 1 — z(f) € K , z(0) = zo, and z(l) = z, then 

. oo 

/ /(z)dz = y'-^ L r (z-zo) ,,+ 1 . (16.30) 

Jy ,, n + 1 


n = 0 


Remark 5 Here f /(z)dz := f Q l f(z(t))z'(t) dr. In particular, if the equality 
f(x) = a n (x — .Yo) n holds on an interval —R < x — xq < R of the real line R, 
then 



nt)dt=Y, 

n = 0 


— ^—(x -x 0 ) n+1 . 
n + 1 


Proof Since lim^oo "~f/n\c n \ — lim„^oo f/\c n \, it follows from the Cauchy- 
Hadamard formula (Theorem 2 of Sect. 16.2.2 that the power series nc„(z — 
zo)" -1 obtained by termwise differentiation of the power series ^^.qC„(z — zo)", 
has the same convergence disk K as the original power series. But by Theorem 2 of 
Sect. 16.2.2 the series YlnLi nc n(z ~ -Jo)" -1 converges uniformly in any closed disk 
K q contained in the interior of K. Since the series ^ c„ (z — zo)" obviously con- 
verges at z = zo, Corollary 5 is applicable to it, which justifies the equality (16.29). 
Thus it has now been shown that a power series can be differentiated termwise. 

Let us now verify that it can also be integrated termwise. 

If y : [0, 1] — > K is a smooth path in K, there exists a closed disk K q such that 
y c K q and K q C K . On K q the original power series converges uniformly, so that 
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in the equality 

oo 

f(z (0) = ~ zo)” 

n = 0 

the series of continuous functions on the right-hand side converges uniformly on 
the closed interval 0 < t < 1 to the continuous function f(z(t)). Multiplying this 
equality by the function z'(t), which is continuous on the closed interval [0, 1], does 
not violate either the equality itself nor the uniform convergence of the series. Hence 
by Theorem 3 we obtain 




z(t) - zofzitjdt. 


But, 



z(t) - z(0)) n z' (t) dt 


— ~r f d(z(t) - z(0)) n+l = 
n + l Jo 


-^r(z(l) - z(0))' ?+1 = (z ~ zo) 
+ 1 n + l 


n+l 


and we arrive at Eq. (16.30). □ 

Since it is obvious that cq — f(zo) in the expansion f(z) = J2nLo c n(z ~ Zo) n , 

applying Eq. (16.29) successively, we again obtain the relation c n = — , which 

shows that a power series is uniquely determined by its sum and is the Taylor series 
of the sum. 

Example 5 The Bessel function J n (x), n e N, is a solution of Bessel’s^ equation 
x 2 y" + xy + (x 2 - n 2 )y — 0. 

Let us attempt to solve this equation, for example, for n — 0, as a power series 
y = Jfj—o c k xk ■ Applying formula (16.29) successively, after elementary transfor- 
mations, we arrive at the relation 

OO 

Cl + ^2(k 2 Ck + Ck- 2)x k ~ x = 0, 
k = 0 

from which, by the uniqueness of the power series with a given sum, we find 
d=0, k 2 Ck + Ck-2 = 0, k = 2,3, 


3 F.W. Bessel (1784-1846) - German astronomer. 
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From this it is easy to deduce that C 2 k-\ = 0, k e N, and cjk = (— 1)* (k! ^ 
we assume /o(0) = 1, we arrive at the solution 

oo 2k 

&=1 

This series converges on the entire line M (and in the entire plane C), so that all the 
operations carried out above in order to find its specific form are now justified. 


Example 6 In Example 5 we sought a solution of an equation as a power series. But 
if a series is given, using formula (16.29), one can immediately check to see whether 
it is the solution of a given equation. Thus, by direct computation, one can verify 
that the function introduced by Gauss 


OO 

F(a,p,y,x)= 1 + ^ 

n= 1 


a(a + 1) • • ■ (ct + n - l)g(g + 1) ■ ■ ■ (g + n - 1) „ 

r(r + !)•••(/ + n - i) 


(the hypergeometric series) is well-defined for |x| < 1 and satisfies the so-called 
hypergeometric equation 

x(x — 1 )y" — [y — (a + / 1 — l)x] ■ y' + a/3 ■ y = 0. 

In conclusion we note that, in contrast to Theorems 2 and 3, the hypotheses of 
Theorem 4 require that the family of derivatives, rather than the original family, con- 
verge uniformly. We have already seen (Example 2 of Sect. 16.1) that the sequence 
of functions /„ (x) = ^ sin n 2 x converges to the differentiable function fix) = 0 
uniformly, while the sequence of derivatives f t ',(x) does not converge to f (x). The 
point is that the derivative characterizes the rate of variation of the function, not the 
size of the values of the function. Even when the function changes by an amount 
that is small in absolute value, the derivative may formally change very strongly, as 
happens in the present case of small oscillations with large frequency. This is the 
circumstance that lies at the basis of Weierstrass’ example of a continuous nowhere- 
differentiable function, which he gave as the series f(x) = Yl^=o a" cos(b n Ttx), 
which obviously converges uniformly on the entire line M if 0 < a < 1 . Weierstrass 
showed that if the parameter b is chosen so as to satisfy the condition a -b > 1 + 
then on the one hand / will be continuous, being the sum of a uniformly convergent 
series of continuous functions, while on the other hand, it will not have a derivative 
at any point rel. The rigorous verification of this last assertion is rather taxing, 
so that those who wish to obtain a simpler example of a continuous function having 
no derivative may see Problem 5 in Sect. 5.1. 


16.3.6 Problems and Exercises 

1. Using power series, find a solution of the equation y"(x) — y(x) = 0 satisfying 
the conditions 
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a) v(0) = 0, y(l) = 1; 

b) y(0) = 1, v(l) = 0. 
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2. Find the sum of the series VEi ' . , , . 

3. a) Verify that the function defined by the series 


oo 

Jn(x) = ^ 
k = 0 


(-D* 

k\(k + n)\ 



2 k+n 


is a solution of Bessel’s equation of order n > 0 from Example 5. 

b) Verify that the hypergeometric series in Example 6 provides a solution of the 
hypergeometric equation. 

4 . Obtain and justify the following expansions, which are suitable for computation, 
for the complete elliptic integrals of first and second kind with 0 < k < 1 . 


K(k) = 


E(k) = 


r n /2 

Jo 


d <p 


■J\ — k 2 sin 2 <p 


n 

2 


r*/ 2 / 7T — 

/ J 1 — k 2 sin 2 (p dtp = 

Jo 




5. Find 

a) ELorV^; 

b) T!k=o rk cosk( P’ 

c) TJk=o rksink( P- 


Show that the following relations hold for |r| < 1: 

OO kpikip _ 


d) 

e) \ + Ei£i yk cos k(p = \-j Z 


1— r cos (p—ir sin^ ’ 

l-r 2 


2 r cos cp-\-r z 


Verify that the following equations are true in the sense of Abel summation: 

g) \ + YlkLi cos kcp = 0 if cp ^ 2: rn, n e Z; 

h) sinktp = J cot | if <p ^ 2 mi, n e Z. 

6. After considering the product of the series 


(ao + «! + ••• )(b o + b\ + • • • ) — (c o + ci + • • • ), 


where c n — aob n + a\b n -\ + ■ • ■ + a n -\b\ + a„bo, and using Proposition 1, show 
that if the series E,Eo an ’ E/Eo^»’ an d E;Eo c " converge respectively to A, B, 
and C, then A ■ B — C. 


16.3 Functional Properties of a Limit Function 
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7. Let s„ = Ylk= \ a k &n — \^!k=\ s k- The series is Cesaro 4 summable, 
more precisely (c, l)-summable to A, if lim,,-^ cr„ — A. In that case we write 
£*ifljfc = A(c,l). 

a) Verify that 1— 1+1 — lH = \{c, 1). 

b) Show that a n = E" =1 (l - ^~)ak- 

c) Verify that if Y^kLi ak — A hi the usual sense, then ak = A(c, 1). 

d) The (c, 2)-sum of the series YlkLi ak is the quantity lim„^oo -(trH |-cr„) 

if this limit exists. In this way one can define the (c, r)-sum of any order r . Show 
that if ]T£ij a k — A(c, r), then a k = Me, r + 1). 

e) Prove that if <Lt = A(c, 1), then the series is also Abel summable to A. 

8. a) A “ theorem of Tauberian type ” is the collective description for a class of 
theorems that make it possible, by introducing various extra hypotheses, to judge 
the behavior of certain quantities from the behavior of certain of their means. An 
example of such a theorem involving Cesaro summation of series is the following 
proposition, which you may attempt to prove following Hardy. 4 5 

If a n = A(c, 1) and a n = O(^), then the series a n converges in the 
ordinary sense and to the same sum. 

b) Tauber’s 6 original theorem relates to Abel summation of series and consists 
of the following. 

Suppose the series a n x n converges for 0 < x < 1 and lim A -_>. i _q a n x 
x n = A. If lim^oo a i+^ a 2 + -+ na n _ o, then the series a n converges to A in 

the ordinary sense. 

9. It is useful to keep in mind that in relation to the limiting passage under the in- 
tegral sign there exist theorems that give much freer sufficient conditions for the 
possibility of such a passage than those made possible by Theorem 3. These theo- 
rems constitute one of the major achievements of the so-called Lebesgue integral. 
In the case when the function is Riemann integrable on a closed interval [a,b\, 
that is, / e lZ[a, b], this function also belongs to the class C[a , b] of Lebesgue- 
integrable functions, and the values of the Riemann integral (R) J* f(x)dx of / 
and the Lebesgue integral ( L ) f (x ) dx are the same. 

In general the space C[a,b] is the completion of ’R\a. b] (more precisely, 
IZ[a,b]) with respect to the integral metric), and the integral (L) is the con- 
tinuation of the linear functional ( R ) f from TZ[a, b] to C[a, b]. 

The definitive Lebesgue “dominated convergence” theorem asserts that if a se- 
quence {f n \ n e N} of functions f n e C[a,b\ is such that there exists a non- 


4 E. Cesaro (1859-1906) - Italian mathematician who studied analysis and geometry. 

5 G.H. Hardy (1877-1947) - British mathematician who worked mainly in number theory and 
theory of functions. 

6 A. Tauber (b. 1866, year of death unknown) - Austrian mathematician who worked mainly in 
number theory and theory of functions. 
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negative function F e C[a,b\ that majorizes the functions of the sequence, that 
is, \f,(x)\ < F(x ) almost everywhere on [a,b], then the convergence f„ — * / 
at almost all points of the closed interval [ a,b ] implies that f e C[a,b] and 
lim„^.oo(L) f b f n (x)dx = (L) f b f(x)(dx). 

a) Show by example that even if all the functions of the sequence { /„ ; n e N} are 
bounded by the same constant M on the interval [a,b], the conditions f n e TZ\a, b], 
n e N and /„ — >• / pointwise on [a, b] still do not imply that / e 7 Z[a, b], (See 
Example 5 of Sect. 16.1.) 

b) From what has been said about the relation between the integrals ( R ) f b and 
(L) f b and Lebesgue’s theorem show that, under the hypotheses of part a), if it is 
known that / e 7 Z[a, b], then ( R ) f b fix) dx = lim„_ i , 00 (7?) f b f n (x) dx. This is a 
significant strengthening of Theorem 3. 

c) In the context of the Riemann integral one can also state the following version 
of Lebesgue’s monotone convergence theorem. 

If the sequence {f, ,;n e N} of functions /„ e 7 Z[a, b] converges to zero mono- 
tonically, that is, 0 < f n +\ < fn and f n 0 as n —>■ oo for every x e [a, b], then 
(R) fn(x) dx^O. 

Prove this assertion, using where needed the following useful observation. 

d) Let / e 7Z[a, b], \ f\ < M, and /J /(x)dv > a > 0. Then the set E — {x e 
[0, 1] | fix) > ff/2} contains a finite number of such intervals the sum of whose 
lengths (/) is at least a/(4 M). 

Prove this, using, for example, the intervals of a partition P of the closed in- 
terval [0, 1] for which the lower Darboux sum s(f, P) satisfies the relation 0 < 
/o fix) dx - sif, P) < a/4. 

10 . a) Show by the examples of Sect. 16.1, that it is not always possible to extract 
a subsequence that converges uniformly on a closed interval from a sequence of 
functions that converge pointwise on the interval. 

b) It is much more difficult to verify directly that it is impossible to extract a 
subsequence of the sequence of functions {/„; n e N}, where / H (x) = sinnx, that 
converges at every point of [0, 2 n]. Prove that this is nevertheless the case. (Use 
the result of Problem 9b) and the circumstance that fj n (sin n^x — sinn^+ix) 2 dv = 
2;r ^ 0 for < n, t+i.) 

c) Let {/„; n e N} be a uniformly bounded sequence of functions /„ e lZ[a, b\. 
Let 

Fn (x) = f fn it) dt la <x <b). 

J a 

Show that one can extract a subsequence of the sequence { F n ; n e N} that con- 
verges uniformly on the closed interval [a, b], 

11 . a) Show that if /, f n e lZl[a, b], K) and f n =1 / on [a, b] as n — >■ oo, then for 
every e > 0 there exists an integer N e N such that 
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fn)(x)dx 


< s(b — a) 


for every n > N. 

b) Let f„ e C^([a, b], R), n e N. Using the formula /„(.*) = f„(x o) + 
fx fh(t) df, show that if / 7 — (p on [a. b] and there exists a point xo e [a, b] 
for which the sequence {f„(x o); n e N ) converges, then the sequence of functions 
{/„; tieM) converges uniformly on [a, b] to some function / e C xlx ([a, b], R) and 
fn=*f' = <P- 


16.4 ^Compact and Dense Subsets of the Space of Continuous 
Functions 

The present section is devoted to more specialized questions, involving the space of 
continuous functions, which is ubiquitous in analysis. All these questions, like the 
metric of the space of continuous functions 7 itself, are closely connected with the 
concept of uniform convergence. 


16.4.1 The Arzela-Ascoli Theorem 

Definition 1 A family T of functions / : X — »■ Y defined on a set X and assuming 
values in a metric space Y is uniformly bounded on X if the set of values V — {y e 

Y | 3/ e T 3x e X (y = /(.*))} of the functions in the family is bounded in Y . 

For numerical functions or for functions / : X — > R", this simply means that 
there exists a constant Mel such that \f(x)\ < M for all x e X and all functions 
feB. 

Definition 1' If the set V C Y of values of the functions of the family T is totally 
bounded (that is, for every e > 0 there is a finite e-grid for V in Y ). the family T is 
totally bounded. 

For spaces Y in which the concept of boundedness and total boundedness are 
the same (for example, for R, C, R", and C" and in general in the case of a locally 
compact space Y), the concepts of uniform boundedness and total boundedness are 
the same. 

Definition 2 Let X and Y be metric spaces. A family T of functions / : X — > 

Y is equicontinuous on X if for every e > 0 there exists 8 > 0 such that 


7 If you have not completely mastered the general concepts of Chap. 9, you may assume without 
any loss of content in the following that the functions discussed always map R into R or C into C, 
or R m into R" . 
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dy(f(x i)> f( x l)) < e for any function / in the family and any X\,X 2 e X such 
that dx(x\ , xi) < 8. 

Example 1 The family of functions {x 11 : n e N| is not equicontinuous on [0, 1], but 
it is equicontinuous on any closed interval of the form [0, q ] where 0 < q < 1. 

Example 2 The family of functions {sin nx\n e N | is not equicontinuous on any 
nondegenerate closed interval [a,b] C R. 

Example 3 If the family {/„ : [a, b] — >• R; a e A} of differentiable functions f a 
is such that the family {/^; a e A} of their derivatives is uniformly bounded by a 
constant, then \f a (x 2 ) — fct(x i)| < M \x 2 — xi|, as follows from the mean- value the- 
orem, and hence the original family is equicontinuous on the closed interval [a, b]. 

The connection of these concepts with uniform convergence of continuous func- 
tions is shown by the following lemma. 

Lemma 1 Let K and Y be metric spaces, with K compact. A necessary condition 
for the sequence {f n \ n e N} of continuous functions f n :K^Yto converge uni- 
formly on K is that the family {/„; n e N} be totally bounded and equicontinuous. 

Proof Let /„ =1 / on K . By Theorem 2 of Sect. 16.3, we conclude that / e 
C(K, Y). It follows from the uniform continuity of / on the compact set K that for 
every e > 0 there exists 8 > 0 such that (dxix i, * 2 ) < ^ => dyifixi), f(x 2 )) < s ) 
for all x \ , X 2 e K. Given the same s > 0 we can find an index /V e N such 
that dy(f(x), f n {x)) < s for all n > N and all x e X. Combining these in- 
equalities and using the triangle inequality, we find that dn(x 1 ,^ 2 ) < 8 implies 
dy(f n (x\ ), fn( x 2 )) < 3 s for every n > N and xi,X 2 e K. Hence the family 
{/„; n > N } is equicontinuous. Adjoining to this family the equicontinuous family 
{/l, • • • , /iv) consisting of a finite number of functions continuous on the compact 
set K , we obtain an equicontinuous family {/„; n <= N | . 

Total boundedness of 3F follows from the inequality dy(f(x), /„(x )) < e, which 
holds for x e K and n > N, and the fact that f(K) and U;^=i fn(K) are compact 
sets in Y and hence totally bounded in Y. □ 

Actually the following general result is true. 

Theorem 1 (Arzela-Ascoli) Let T be a family of functions f : K —> Y defined on 
a compact metric space K with values in a complete metric space Y . 

A necessary and sufficient condition for every sequence {/„ e T; n e N} to con- 
tain a uniformly convergent subsequence is that the family T be totally bounded 
and equicontinuous. 

Proof Necessity. If T were not a totally bounded family, one could obviously con- 
struct a sequence { /„ ; n e N} of functions /„ e T that would not be totally bounded 
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and from which (see the lemma) one could not extract a uniformly convergence 
subsequence. 

If T is not equicontinuous, there exist a number £o > 0, a sequence of functions 
{/„ eJ;ne N}, and a sequence {(x ' n , x "); n e N} of pairs (x ' n , x") of points x' n and 
x" that converge to a point xo € K as n — > oo, but dy(f n (x' n ), /„(x")) > £o > 0. 
Then one could not extract a uniformly convergent subsequence from the sequence 
{/„; n e N}: in fact, by Lemma 1, the functions of such a subsequence must form an 
equicontinuous family. 

Sufficiency. We shall assume that the compact set K is infinite, since the assertion is 
trivial otherwise. We fix a countable dense subset E in K - a sequence {x„ e K ; n e 
N). Such a set E is easy to obtain by taking, for example, the union of the points of 
finite e-grids in K obtained for e = 1 , 1/2, . . . , 1/n, 

Let {/„; n e N) he an arbitrary sequence of functions of T . 

The sequence {/„ (xq); n e N} of values of these functions at the point x\ is totally 
bounded in Y by hypothesis. Since Y is a complete space, it is possible to extract 
from it a convergent subsequence {f nk (x i); k e N[. The functions of this sequence, 
as will be seen, can be conveniently denoted /J , n e N. The superscript 1 shows 
that this is the sequence constructed for the point xi . 

From this subsequence we extract a further subsequence { f f ] k ; k € N} which we 
denote { / n 2 ; n e N} such that the sequence {/J- (X 2 ); leN} converges. 

Continuing this process, we obtain a series {/*; n e N}, k = 1,2,... of se- 
quences. If we now take the “diagonal” sequence {g„ = /"; n e N}, it will converge 
at every point of the dense set E C K, as one can easily see. 

We shall show that the sequence {g n ',n e N ) converges at every point of K 
and that the convergence is uniform on K. To do this, we fix s > 0 and choose 
S > 0 in accordance with Definition 2 of equicontinuity of the family T . Let 
E\ = {£ 1 , . . . , %k] be a finite subset of E forming a 5-grid on K. Since the se- 
quences {g„(^i)\n e N}, i = 1,2, all converge, there exists N such that 

d-Y (,8m gn(%i)) < e for i = 1 , 2 , . . . , k and all m , n > N . 

For each point x e K there exists e E such that cIk(x, //) < S. By the equicon- 
tinuity of the family T , it now follows that dy(g n (x), g n (?/)) < £ for every n e N. 
Using these inequalities, we now find that 

dy ( grn (x), gn (x))< dy (g n (x) , g n (gj )) + dy (g m (fj), g n (? j j) + 

+ dy (g m (x), g m ($j)) <£ + £ + £ = 3e 


for all m, n > N . 

But x was an arbitrary point of the compact set K, so that, by the Cauchy criterion 
the sequence { g n : n e I ) } indeed converges uniformly on K. □ 
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16.4.2 The Metric Space C(K, Y) 

One of the most natural metrics on the set C(K,Y) of functions / : K — > Y that are 
continuous on a compact set K and assume values in a complete metric space Y is 
the following metric of uniform convergence. 

d(f,g) = maxd Y (f(x),g(x)), 
xe.K 

where /,ge C ( K , Y), and the maximum exists, since K is compact. The name 
metric comes from the obvious fact that d( f n , f) —>■ 0 /„ =1 / on K. 

Taking account of this last relation, by Theorem 2 of Sect. 16.3 and the Cauchy 
criterion for uniform convergence we can conclude that the metric space C(K , Y) 
with the metric of uniform convergence is complete. 

We recall that a precompact subset of a metric space is a subset such that from 
every sequence of its points one can extract a Cauchy (fundamental) subsequence. 
If the original metric space is complete, such a sequence will even be convergent. 

The Arzela-Ascoli theorem gives a description of the precompact subsets of the 
metric space C(K, Y). 

The important theorem we are about to prove gives a description of a large va- 
riety of dense subsets of the space C(K, Y). The natural interest of such subsets 
comes from the fact that one can approximate any continuous function / : K — > Y 
uniformly with absolute error as small as desired by functions from these subsets. 

Example 4 The classical result of Weierstrass, to which we shall often return, and 
which is generalized by Stone’s theorem below, is the following. 

Theorem 2 (Weierstrass) If f e C([a, b],C), there exists a sequence { P n ; n e N} of 
polynomials P n : [a, b] -> C such that P„ f on [a, b]. Here, if f e C([a, b], M), 
the polynomials can also be chosen from C([a , b], M). 

In geometric language this means, for example, that the polynomials with real 
coefficients form an everywhere dense subset of C([a, b], R). 

Example 5 Although Theorem 2 still requires a nontrivial proof (given below), one 
can at least conclude from the uniform continuity of any function f e C([a,b],]$.) 
that the piecewise-linear continuous real-valued functions on the interval [a, b] are 
a dense subset of C([a , b ] , R) . 

Remark 1 We note that if E\ is everywhere dense in E 2 and £2 is everywhere dense 
in £ 3 , then E\ is obviously everywhere dense in £ 3 . 

This means, for example, that to prove Theorem 2 it suffices to show that a piece- 
wise linear function can be approximated arbitrarily closely by a polynomial on the 
given interval. 
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16.4.3 Stone’s Theorem 

Before proving the general theorem of Stone, we first give the following proof of 
Theorem 2 (Weierstrass’ theorem) for the case of real-valued functions, which is 
useful in helping to appreciate what is to follow. 

Proof We first remark that if /, g e C([a, b], R), a e R, and the functions / and g 
admit a uniform approximation (with arbitrary accuracy) by polynomials, then the 
continuous functions / + g, / • g, and af also admit such an approximation. 

On the closed interval [—1, 1], as was shown in Example 2 of Sect. 16.3, the 
function |x| admits a uniform approximation by polynomials P n (x) — a kX k ■ 
Hence, the corresponding sequence of polynomials M ■ P n (x / M ) gives a uniform 
approximation to |x| on the closed interval \x\ < M. 

If / e C([a. b], R) and M = max |/(jc)|, it follows from the inequality ||y| — 
£JU c *y*l < e for \y\ - M that ll/MI - Ylk=i c kf k (x)\ < e for a<x<b. 
Hence if / admits a uniform approximation by polynomials on [n , b] , then 
£it=i °kf k and | /| also admit such an approximation. 

Finally, if / and g admit a uniform approximation by polynomials on the closed 
interval [a, b], then by what has been said, the functions max{/, g} = \((f + g) + 
1/ — g|) and min{/, g} = \((f + g) — \f — g|) also admit such an approximation. 

Let a < < % 2 < b, f(x) = 0, g^ 2 (x) = ^0^, h(x) = 1, <P^ 2 = 

max{/, and F^ 2 = min{/i, Linear combinations of functions of the 

form Fj jj 2 obviously generate the entire set of continuous piecewise-linear func- 
tions on the closed interval [a, b], from which, by Example 5, Weierstrass’ theorem 
follows. □ 

Before stating Stone’s theorem, we define some new concepts. 

Definition 3 A set A of real- (or complex-)valued functions on a set X is called a 
real (or complex ) algebra of functions on X if 

(f + g)eA, (f ■ g) e A, (af) e A 

when /, g e A and a e R (or a e C). 

Example 6 Let IcC. The polynomials P (z) — cq + c\ z + C 2 Zr H V c n z n , n e N, 

obviously form a complex algebra of functions on X. 

If we take X — [a. b] C R, and take only polynomials with real coefficients, we 
obtain a real algebra of functions on the closed interval [a , b] . 

Example 7 The linear combinations of functions e" A , n = 0,1,2,... with coeffi- 
cients in R or C also form a (real or complex respectively) algebra on any closed 
interval [a, b] C R. 

The same can be said of linear combinations of the functions {e" !A ; n e Z}. 
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Definition 4 We shall say that a set S of functions on X separates points on X if 
for every pair of distinct points x i , x 2 e X there exists a function f e S such that 

Example 8 The set of functions {e m ; n e N | , and even each individual function in 
the set, separates points on R. 

At the same time, the 2jt -periodic functions { e" iA ; n e X) separates points of a 
closed interval if its length is less than 2it and obviously does not separate the points 
of an interval of length greater than or equal to 2 it . 

Example 9 The real polynomials together form a set of functions that separates rates 
the points of every closed interval [a, b], since the polynomial P(x) = x does that 
all by itself. What has just been said can be repeated for a set X C C and the set of 
complex polynomials on X. As a single separating function, one can take P(z) = Z. 

Definition 5 The family T of functions / : X —*■ C does not vanish on X (is nonde- 
generate) if for every point xq e X there is a function /o e T such that fo(xo) ^ 0. 

Example 10 The family T = {1, x, x 2 , . . .} does not vanish on the closed interval 
[0, 1], but all the functions of the family Tq — {x, x 2 , . . .} vanish at x = 0. 

Lemma 2 If an algebra A of real (resp. complex ) functions on X separates the 
points of X and does not vanish on X, then for any two distinct points x \ , X 2 £ X 
and any real (resp. complex ) numbers c i, C 2 there is a function f in A such that 
fix l) = ci and f(x 2 ) = 02 - 

Proof It obviously suffices to prove the lemma when ci = 0, C 2 = 1 and when 

ci = 1, C2 = 0. 

By the symmetry of the hypotheses on x\ and x 2 , we consider only the case 
ci = 1, c 2 = 0. 

We begin by remarking that A contains a special function .v separating the points 
xi and x 2 that, in addition to the condition .v (xi ) f- s(x 2 ), also satisfies the condition 
s(xi) ^0. 

Let g,h e A, g(xi) ^ g(x 2 ), gix i) = 0, and h(x\) ^ 0. There is obviously a 
number X e M\0 such that X(h(x i) — h(x 2 )) ^ g(x 2 ). The function s — g + Xh then 
has the required properties. 

Now, setting f(x)= » we obtain a function / in the algebra A 

satisfying fix i) = 1 and f(x 2 ) = 0. □ 

Theorem 3 (Stone 8 ) Let A be an algebra of continuous real-valued functions de- 
fined on a compact set K . If A separates the points of K and does not vanish on K, 
then A is an everywhere-dense subspace of C(K, M). 


“M.H. Stone (1903-1989) - American mathematician who worked mainly in topology and func- 
tional analysis. 
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Proof Let A be the closure of the set A C C(K, R) in C(K, R), that is, A consists 
of the continuous functions /eC(li,R) that can be approximated uniformly with 
arbitrary precision by functions of A. The theorem asserts that A = C(K , R). 

Repeating the reasoning in the proof of Weierstrass’ theorem, we note that if 
/, g e A and a e R, then the functions / + g, / • g, otf, \f\, max{/, g}, min{/, g} 
also belong to A. By induction we can verify that in general if /i, fi, . . . , f n € A, 
then max{/i, f 2 , . . . , /„} and min{/i, / 2 , . . . , fn} also lie in A. 

We now show that for every function / e C(K, R), every point x e K, and every 
number s > 0, there exists a function g x e A such that g x (x) = f (x ) and g x (t) > 
f(t) — s for every t e K. 

To verify this, for each point y e K we use Lemma 2 to choose a function h y e A 
such that h y (x) = fix) and h y (y) = f(y). By the continuity of / and h y on K, 
there exists an open neighborhood U y of y such that h y (t) > fit) — e for every 
t eU y . From the covering of the compact set K by the open sets U y we select a finite 
covering {U yi , U y2 , U y J. Then the function g x = ma x{h n ,h n , , h yn ] e A 
will be the desired function. 

Now taking such a function g x for each point x e K, we remark that by the 
continuity of g x and /, there exists an open neighborhood of x e K such that 
g x (t) < fit ) + e for every re V x . Since K is compact, there exists a finite cover- 
ing { V XI , V X2 V Xm } by such neighborhoods. The function g = min {g xi g Xm \ 
belongs to A and by construction, satisfies both inequalities 

fit)-s<git) < fit)+e 


at every point. 

But the number e > 0 was arbitrary, so that any function / e CiK, R) can be 
uniformly approximated on K by functions in A . □ 


16.4.4 Problems and Exercises 

1. A family T of functions / : X —*■ Y defined on the metric space X and assuming 
values in the metric space Y is equicontinuous at xq e X if for every s > 0 there 
exists 8 > 0 such that dxix, .ro) < <5 implies dyifix), fix o)) < s for every / e T . 

a) Show that if a family T of functions / : X -> Y is equicontinuous at io e I, 
then every function / e T is continuous at xq, although the converse is not true. 

b) Prove that if the family T of functions / : K — > Y is equicontinuous at each 
point of the compact set K, then it is equicontinuous on K in the sense of Defini- 
tion 2. 

c) Show that if a metric space X is not compact, then equicontinuity of a family 
T of functions / : X — > Y at each point x e X does not imply equicontinuity of T 
on X. 

For this reason, if the family T is equicontinuous on a set X in the sense of 
Definition 2, we often call it uniformly equicontinuous on the set. Thus, the relation 
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between equicontinuity at a point and uniform equicontinuity of a family of func- 
tions on a set X is the same as that between continuity and uniform continuity of an 
individual function / : X — »■ Y on the set X . 

d) Let a>(f; E) be the oscillation of the function / : X -> Y on the set E C X, 
and B(x, 8) the ball of radius 8 with center at x e X. What concepts are defined by 
the following formulas? 

Ve > 0 35 > 0 V/ e T co(f; B(x, 8 )) < s, 

Ve > 0 35 > 0 V/ e T Vx e X co(f; B(x, 8 )) < e. 

e) Show by example that the Arzela-Ascoli theorem is in general not true if 
K is not compact: construct a uniformly bounded and equicontinuous sequence 
{/„; n e N} of functions f„(x) = tp(x + n) from which it is not possible to extract a 
subsequence that converges uniformly on R. 

f) Using the Arzela-Ascoli theorem, solve Problem 10c) from Sect. 16.3. 

2. a) Explain in detail why every continuous piecewise-linear function on a closed 
interval [a, b ] can be represented as a linear combination of functions of the form 
F|j| 2 shown in the proof of Weierstrass’ theorem. 

b) Prove Weierstrass’ theorem for complex-valued functions / : [a, b] -* C. 

c) The quantity M n = f(x)x" dx is often called the nth moment of the func- 
tion / : [a, b] — »• C on the closed interval [a,b]. Show that if / e C([a, b], C) and 
M„ = 0 for all n e N, then f(x) = 0 on [a, b], 

3. a) Show that the algebra generated by the pair of functions {l,x 2 } is dense in 
the set of all even functions that are continuous on [— 1, 1], 

b) Solve the preceding problem for the algebra generated by the single function 
{a') and the set of odd functions that are continuous on [—1, 1], 

c) Is it possible to approximate every function / e C([0, 7r], C) uniformly with 
arbitrary precision by functions in the algebra generated by the pair of functions 
1 1 . e' ' } 7 

d) Answer the preceding question in the case of / e C([— it, tt] , C). 

e) Show that the answer to the preceding question is positive if and only if 

/(-tt) = /(tt). 

f) Can every function / e C([a, b], C) be uniformly approximated by linear 
combinations of the functions {1, cosx, sinx, . . . , cosnx, sinnx, . . .} if [a,b] C 

]— TT, 7T[? 

g) Can any even function / e C([— n, 7r], C) be uniformly approximated by 
functions of the system {1, cosx, . . . , cos nx , . . .}? 

h) Let [a, b] be an arbitrary closed interval on the real line R. Show that the 
algebra generated on [a, b] by any nonvanishing strictly monotonic function (p(x) 
(for example, e A ) is dense in C([a, b\, R). 

i) For which location of the closed interval [a,b] C R is the algebra generated 
by <p(x ) = x dense in C([a, b], R)? 

4. a) A complex algebra of functions A is self-adjoint if it follows from / e A 
that / e A, where / (x ) is the value conjugate to f(x). Show that if a complex 


16.4 *Subsets of the Space of Continuous Functions 


403 


algebra A is nondegenerate on X and separates the points of X, then, given that A 
is self-adjoint, one can assert that the subalgebra Ar of real-valued functions in A 
is also nondegenerate on X and also separates points on X. 

b) Prove the following complex version of Stone’s theorem. 

If a complex algebra A of functions f : X — > C is nondegenerate on X and 
separates the points of X, then, given that it is self-adjoint, one can assert that it is 
dense in C(X , C). 

c) Let X = {zeC||z| = l}be the unit circle and A the algebra on X generated 
by the function e ,f) , where tp is the polar angle of the point z e C. This algebra is 
nondegenerate on X and separates the points of X, but is not self-adjoint. 

Prove that the equalities / q 2t f(e, l<p )e, nlv d (p = 0, n e N, must hold for any func- 
tion / : X — > C that admits uniform approximation by elements of A. Using this 
fact, verify that the restriction of the function f(z) — Z to the circle X is a continu- 
ous function on X that does not belong to the closure of the algebra A. 



Chapter 17 

Integrals Depending on a Parameter 


In this chapter the general theorems on families of functions depending on a param- 
eter will be applied to the type of family most frequently encountered in analysis - 
integrals depending on a parameter. 

17.1 Proper Integrals Depending on a Parameter 

17.1.1 The Concept of an Integral Depending on a Parameter 

An integral depending on a parameter is a function of the form 



(17.1) 


where t plays the role of a parameter ranging over a set T, and to each value t e T 
there corresponds a set E t and a function <p r (x) = /(x, t) that is integrable over E, 
in the proper or improper sense. 

The nature of the set T may be quite varied, but of course the most important 
cases occur when T is a subset of ]R, C, R" , or C" . 

If the integral (17.1) is a proper integral for each value of the parameter t e T, 
we say that the function F in (17.1) is a proper integral depending on a parameter. 

But if the integral in (17.1) exists only as an improper integral for some or all 
of the values of t e T, we usually call F an improper integral depending on a 
parameter. 

But these are of course merely terminological conventions. 

When x e R m , E t C M”' , and m > 1 , we say that we are dealing with a multiple 
(double, triple, and so forth) integral (17.1) depending on a parameter. 

We shall concentrate, however, on the one-dimensional case, which forms the 
foundation for all generalizations. Moreover, for the sake of simplicity, we shall 
first take E, to be intervals of the real line R independent of the parameter, and we 
shall assume that the integral (17.1) over these intervals exists as a proper integral. 
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17.1.2 Continuity of an Integral Depending on a Parameter 


Proposition 1 Let P — {(x, y) el 2 | a < x < b A c < y < d) be a rectangle in the 
plane R 2 . If the function f : P — > R is continuous, that is, if f e C(P, R), then the 
function 

F(y)= f fix, y)dx (17.2) 

J a 

is continuous at every point y e [c, d]. 


Proof It follows from the uniform continuity of the function / on the compact set P 
that tp y {x) := fix, y) =1 fix, y 0 ) =: <p y 0 (x) on [a, b] as y -* y 0 , for y, y 0 e [c, d]. 
For each y e [c, d] the function (p y (x ) = fix, y) is continuous with respect to x on 
the closed interval [a, b] and hence integrable over that interval. By the theorem on 
passage to the limit under an integral sign we can now assert that 

b rb 

/(x,y 0 )dx = lim / f(x,y) dx = lim F(y). n 

y->yo J a y^-yo LJ 

Remark 1 As can be seen from this proof. Proposition 1 on the continuity of the 
function (17.2) remains valid if we take any compact set K as the set of values of 
the parameter y, assuming, of course, that / e C(/ x K, R), where I — {x e R | 
a < x < b). 

Hence, in particular, one can conclude that if / e C(/ x D,R), where D is 
an open set in R", then F e C( D. R), since every point yo e D has a compact 
neighborhood K C D, and the restriction of / to I x K is a continuous function on 
the compact set I x K. 

We have stated Proposition 1 for real- valued functions, but of course it and its 
proof remain valid for vector-valued functions, for example, for functions assuming 
values in C, R m , or C m . 



Example 1 In the proof of Morse’s lemma (see Sect. 8.6, Part 1) we mentioned the 
following proposition, called Hadamard’s lemma. 

If a function f belongs to the class C^\U, R) in a neighborhood U of the point 
xo, then in some neighborhood of xo it can be represented in the form 

.fix) = fix o) + <p{x){x - x 0 ), (17.3) 

where tp is a continuous function and tpix o) = fix o). 

Equality (17.3) follows easily from the Newton-Leibniz formula 

f (x 0 + h) - f (x Q ) = [ f'ixo + th) dt ■ h 

Jo 


(17.4) 
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r ] 

and Proposition 1 applied to the function F(h) = J Q / (x o + th ) dr. All that remains 
is to make the substitution h = x — x o and set ip(x ) = Fix — xo). 

It is useful to remark that Eq. (17.4) holds for xq,Ii e R" , where n is not restricted 
to the value 1 . Writing out the symbol f in more detail, and for simplicity setting 
xq = 0, one can write, instead of (17.4) 



and then one should set 

n 

cp(x)x = E <Pi(x)x‘ 

i = 1 

in Eq. (17.3), where cpi(x) = fj |4(rx) dr. 


17.1.3 Differentiation of an Integral Depending on a Parameter 


Proposition 2 If the function f : P — >■ R is continuous and has a continuous partial 
derivative with respect to y on the rectangle P — {(x, y) e R 2 | a < x < b A c < 
y < d], then the integral (17.2) belongs to C^f[c, d],M), and 

F'(y)= [ b ^-(x,y)dx. (17.5) 

Ja oy 

Formula (17.5) for differentiating the proper integral (17.2) with respect to a 
parameter is frequently called Leibniz’ formula or Leibniz’ rule. 


Proof We shall verify directly that if yo e [c, d], then E'(yo) can be computed by 
formula (17.5): 


F(yo + h) - F(y 0 ) 
b 


I 


Hi 

<-f 


9 /, 


fix, y 0 + h ) - fix, y 0 ) - — (x, y 0 )h dx 

9y 


< 


I 9/ 

fix, y 0 + h) - fix, y 0 ) - — (x, y 0 )h 

9y 


dx < 


sup 

O<0<1 


df df 

— (x, y 0 + Oh) - ^-(x, y 0 ) 
oy oy 


dx\h\ = cp(yo, h) -\h\. 


By hypothesis ^ e C(P, R), so that j-(x, y) =1 %(x, yo) on the closed interval 


dy 

a < x < b as y -> yo, from which it follows that cpiyo, h) -> 0 as h —*■ 0. 


□ 
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Remark 2 The continuity of the original function / is used in the proof only as a 
sufficient condition for the existence of all the integrals that appear in the proof. 

Remark 3 The proof just given and the form of the mean-value theorem used in it 
show that Proposition 2 remains valid if the closed interval [c, d\ is replaced by any 
convex compact set in any normed vector space. Here one may obviously assume as 
well that / takes values in some complete normed vector space. 

In particular - and this is sometimes very useful - formula (17.5) is also ap- 
plicable to complex-valued functions F of a complex variable and to functions 
F(y) = F(y l , . . . , v") of a vector parameter y — (y 1 , . . . , y n ) e C". 

In this case can of course be written coordinatewise as (j^r, . ■ . , gyr)> and 
then (17.5) yields the corresponding partial derivatives: 



of the function F. 


Example 2 Let us verify that the function u(x) = f? cos (mp — x simp) d <p satisfies 


Bessel’s equation x 2 u" + xu f + ( x 2 — n 2 )u = 0. 

Indeed, after carrying out the differentiation with formula (17.5) and making 
simple transformations we find 





Jo 

— x simp sin(mp — x simp)) d cp — 

= — (n + x cos tp) sin(72<p — x sin^)|g = 0. 


Example 3 The complete elliptic integrals 



(17.6) 


as functions of the parameter k, 0 < k < 1, called the modulus of the corresponding 
elliptic integral, are connected by the relations 


d£ E-K 


d K 


E 


K 


d k k 


d k k( 1 - k 2 ) k ' 
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Let us verify, for example, the first of these. By formula (17.5) 


d E 
d k 


— f k sin 2 <p • ( 1 — k~ sin 2 tp) *^“d tp = 

Jo 

n jt/2 i t ' 71 ! 2 

J ( 1 — k 2 sin 2 tp ) ~ dtp — - J ( 1 — k 2 sin 2 tp) 


i r 71 ! 1 
k 


2^2„\-W dip= E-£' 


Example 4 Formulas (17.5) sometimes make it possible even to compute the inte- 
gral. Let 

C 71 ! 1 9 

F{a) — I ln(cr — sin 2 tp) dtp (a > 1). 

Jo 

According to formula (17.5) 


F'(a) 



2a dtp 
a 2 — sin 2 tp 


TC 

a 2 — 1 


from which we find F(a) = tt In (a + Va 2 — 1) + c. 

The constant c is also easy to find, if we note that, on the one hand F(a) — 
tt In ct + tt In 2 + c + o( 1 ) as a ^ +oo, and on the other hand, from the dehnition of 
F(a ), taking account of the equality ln(a 2 — sin 2 tp) = 21na + o{ 1) as a —*■ +oo, 
we have F{a) — jt In a + o(l). Hence jr ln2 + c — 0 and so F(a ) = Tt In j(« + 
a 2 — 1). 


Proposition 2 can be strengthened slightly. 


Proposition 2' Suppose the function f : P — > M is continuous and has a continuous 
partial derivative on the rectangle P = {(v,y)eK 2 \ a < x < b A c < y < d}\ 
further suppose a(v) and fi(y) are continuously differentiable functions on [c, d] 
whose values lie in [a, b]for every y e [c, d]. Then the integral 


rP O') 

F(y)= / f(x,y) dx 

Ja(y) 


(17.7) 


is defined for every y e [c, d] and belongs to C (1) ([c, d], M), and the following for- 
mula holds : 

rP(y) Qf 

F'(y) = f(P(y),y) ■ P'(y)~ f(u(y),y) -c('(y)+ —(x,y)dx. (17.8) 

Ja(y) oy 


Proof In accordance with the rule for differentiating an integral with respect to the 
limits of integration, taking account of formula (17.5), we can say that if a, ft e 
[a, b] and y e [c, cl], then the function 

<P(a,P,y)= ( f(x,y)dx 
J a 
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has the following partial derivatives: 


90 


= my). 


90 
9 a 


-f(a,y), 


90 

9y 



— (x, y) Ax. 

9y 


Taking account of Proposition 1 , we conclude that all the partial derivatives of 0 
are continuous in its domain of definition. Hence 0 is continuously differentiable. 
Formula (17.8) now follows from the chain rule for differentiation of the composite 
function F(y ) = 0(a(y), fiy), y). □ 


Example 5 Let 

F n ( X ) = — i —f (x-ty>- l f(t)dt, 

(« - 1 )! Jo 

where n e N and / is a function that is continuous on the interval of integration. Let 
us verify that F,[ n \x) = f(x). 

For 7? = 1 we have Tj(x) — / 0 ' fit) dr and F[{x) — fix). 

By formula (17.8) we find for 77 > 1 that 

Kix) = T ^^Xx-x) n - x f{x)+—^-— f ix-t) n ~ 2 fit)dt=F n - l ix). 

(77-1)! (77 — 2)! Jo 

We now conclude by induction that indeed F n (x) = fix) for every n e N. 


17.1.4 Integration of an Integral Depending on a Parameter 


Proposition 3 If the function f : P — > R is continuous in the rectangle P = 
{(x, y) € R 2 | a < x < b A c < y < d], then the integral (17.2) is integrable over 
the closed interval [c, d] and the following equality holds : 



fix.y) dx 



b 



fix,y)dy 


dx. 


(17.9) 


Proof From the point of view of multiple integrals, Eq. (17.9) is an elementary 
version of Fubini’s theorem. However, we shall give a proof of (17.9) that justifies 
it independently of Fubini’s theorem. 

Consider the functions 

v(u)= [\[ 

Since / e C(P . R), by Proposition 1 and the continuous dependence of the 
integral on the upper limit of integration, we conclude that tp and f belong to 
C([c, d], R). Then, by the continuity of the function (17.2), we find that tp'iu) = 


f (x, y) d.x^ dv, f{u) = [<£ fix, y) dy 


dx. 
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fb f(x, u) dx, and finally by formula (17.5) that i f(x,u)dx for u e 
[c,d]. Thus (p\u) = t y(u), and hence q>(u) — t/r(w) + C on [c, d]. But since 
<p(c) — ir(c) = 0, we have (p(u) = \ fr(u) on [c, d], from which relation (17.9) fol- 
lows for u — d. □ 


17.1.5 Problems and Exercises 


1. a) Explain why the function F{y) in (17.2) has the limit J tp(x) dx if the family 
of functions (p y {x ) = /(x,y) depending on the parameter y e Y and integrable 
over the closed interval a < x < b converges uniformly on that closed interval to a 
function < p(x) over some base B in Y (for example, the base y — >■ vo). 

b) Prove that if £ is a measurable set in R'" and the function / : E x I" — > R. 
defined on the direct product E x I" = {(.v, t) e R" !+ " | x e E A t e /"} of the set 
E and the n -dimensional interval I n is continuous, then the function F defined by 
(17.1) for E, = E is continuous on /”. 

c) Let P — {(x, y) e R 2 \ a < x < b A c < y < d], and let / e C(P , R), a, e 
C([c, d], [«, /?]). Prove that in that case the function (17.7) is continuous on the 
closed interval [c, d], 

2. a) Prove that if / e C(R, R), then the function F(x) = J‘l a f(x + 1) dr is not 
only continuous, but also differentiable on R. 

b) Find the derivative of this function F(x) and verify that F e C^(R,R). 

3. Using differentiation with respect to the parameter, show that for |r| < 1 


F(r)= [ ln(l 

Jo 


2 r cos x + r 2 ) dx = 0. 


4. Verify that the following functions satisfy Bessel’s equation of Example 2: 

a) u = x n Jq cos (x cos (p) sin 2 " i p d< -p\ 

b) J„ (x) — (2 „^| )!!jr 1 — f 2)(n-l/2) cosxf At. 

c) Show that the functions J n corresponding to different values of n e N are 
connected by the relation J n+ j = i — 2 J' n . 

5. Developing Example 3 and setting k := V 1 — k 2 , E(k ) := E(k), K{k) := K(k), 
show, following Legendre, that 

a) ±(EK + EK - KK) = 0. 

b) EK + EK — KK = tt/2. 

6. Instead of the integral (17.2), consider the integral 


E(y)= f f(x,y)g(x) dx, 


where g is a function that is integrable over the closed interval [a, b](g e lZ[a, /;]). 
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By repeating the proofs of Propositions 1-3 above verify successively that 

a) if the function / satisfies the hypotheses of Proposition 1 , then T is continu- 
ous on [c, d] (IF e C[c, d]); 

b) if / satisfies the hypotheses of Proposition 2, then IF is continuously differ- 
entiable on [c, d] (IF e C (1) [c, t/]), and 



c) if / satisfies the hypotheses of Proposition 3, then T is integrable over [c, d] 
(IF e lZ[c, d]) and 



7. Taylor's formula and Hadamard’s lemma. 

a) Show that if / is a smooth function and /( 0) = 0, then f(x) = xcp(x), where 
cp is a continuous function and cp( 0) = /'(()). 

b) Show that if / e C ^ and /®( 0) = 0 for k = 0, 1, . . . , n — 1, then f(x) = 
x n <p(x), where (p is a continuous function and ^>(0) = 

c) Let / be a C <in function defined in a neighborhood of 0. Verify that the 
following version of Taylor’s formula with the Hadamard form of the remainder 
holds: 


fix) = f( 0) + j v f'( 0)x + • • • + —L^f(' l - l >( 0)x n ~ l + x n <p(x), 


where tp is a function that is continuous on a neighborhood of zero, and <p(0) = 



d) Generalize the results of a), b), and c) to the case when / is a function of 
several variables. Write the basic Taylor formula in multi-index notation: 


n — 1 



and note in addition to what was stated in a), b), and c), that if f e C { " +p) , that 
<p a e C (p \ 
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17.2 Improper Integrals Depending on a Parameter 

17.2.1 Uniform Convergence of an Improper Integral with Respect 
to a Parameter 


a. Basic Definition and Examples 

Suppose that the improper integral 

pCO 

F{y) — / f(x.y) dx (17.10) 

J Cl 

over the interval [a, co\ cK converges for each value y e Y. For definiteness we 
shall assume that the integral (17.10) has only one singularity and that it involves 
the upper limit of integration (that is, either u> — +oo or the function / is unbounded 
as a function of x in a neighborhood of a>). 


Definition We say that the improper integral (17.10) depending on the parameter 
y e Y converges uniformly on the set E C Y if for every s > 0 there exists a neigh- 
borhood U[ a , &>[(<«) of w in the set [a, co[ such that the estimate 



< e 


(17.11) 


for the remainder of the integral (17.10) holds for every b £ U[a,a>[ (o>) and every 
y e E. 

If we introduce the notation 


F b (y):= [ f(x,y) dr (17.12) 


for a proper integral approximating the improper integral (17.10), the basic defi- 
nition of this section can be restated (and, as will be seen in what follows, very 
usefully) in a different form equivalent to the previous one: 

uniform convergence of the integral (17.10) on the set E C Y by definition means 
that 


Indeed, 


Fb{y)=XF{y) on E as b -> a>, b e [a, w[. 


(17.13) 


p CO 

F(y)= f(x,y)dx 


/ f{x,y) dx= lim 

b—>(j) J a b—>co 

b£[a,co[ b£[a,co[ 


lim 


Fb(y), 


and therefore relation (17. 1 1) can be rewritten as 

\F(y)-F b (y)\ <e. 


(17.14) 
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This last inequality holds for every b e />[(«) and every y e E, as shown in 
(17.13). 

Thus, relations (17.11), (17.13), and (17.14) mean that if the integral (17.10) 
converges uniformly on a set E of parameter values, then this improper integral 
(17.19) can be replaced by a certain proper integral (17.12) depending on the same 
parameter y with any preassigned precision, simultaneously for all y e E. 

Example 1 The integral 


+ °° dx 


/ 


converges uniformly on the entire set R. of values of the parameter y € R, since for 
every vei 



provided b > 1 /e . 


Example 2 The integral 



obviously converges only when y > 0. Moreover it converges uniformly on every 
set {y e R | y > yo > 0}. 

Indeed, if y > yo > 0, then 



At the same time, the convergence is not uniform on the entire set R + = {y e 
R | y > 0}. Indeed, negating uniform convergence of the integral (17.10) on a set E 
means that 


3eo > 0 VB e [o, &>[ 3b e [B, co[ 3y e E ( If f{x,y) dx > eo 


I Jb 


In the present case so can be taken as any real number, since 



for every fixed value of b e [0, +oo[. 

Let us consider a less trivial example, which we shall be using below. 


Example 3 Let us show that each of the integrals 
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f (?) = 



x°' y <*+P+ i e -(i+*)y dX) 


in which a and f are fixed positive numbers, converges uniformly on the set of 
nonnegative values of the parameter. 

For the remainder of the integral 0 (x ) we find immediately that 


o< / x a y 


a a+p+l -(l+x)y j _ 


l, 

f+oo f+oo 

= (xy) a e~ xy y^ +1 e~ y dy < M a I yP +1 eT y dy, 

Jb Jb 


where M a — maxo< H<+00 . u a e~ u . Since this last integral converges, it can be made 
smaller than any preassigned s > 0 for sufficiently large values of h <= R. But this 
means that the integral 0(x) converges uniformly. 

Let us now consider the remainder of the second integral F(y): 


f+OO 

0< / X a y a+fi+l e- (1+x ' ,y dx = 

Jb 

f+oo f+oo 

= ypR- y (xy) a e- xy ydx = y fi e- y u a e~ u du. 

Jb Jby 


Since 



dn < 



du < +oo, 


for y > 0 and y^e~ y — > 0 as y — >• 0, for each e > 0 there obviously exists a number 
yo > 0 such that for every y e [0, vo] the remainder of the integral will be less than 
e even independently of the value of b e [0, +oo[. 

And if y > yo > 0, taking account of the relations Mp = maxo< y <+oo y^e~ y < 
+oo and 0 < f^°° u a e~ u du < u a e~“ du -+ 0 as b — > +oo, we conclude that 
for all sufficiently large values of b e [0, +oo[ and simultaneously for all y > yo > 0 
the remainder of the integral F(y) can be made less than s. 

Combining the intervals [0, yo] and [yo, +oo[, we conclude that indeed for every 
e > 0 one can choose a number B such that for every b > B and every y > 0 the 
corresponding remainder of the integral F(y) will be less than e. 


b. The Cauchy Criterion for Uniform Convergence of an Integral 

Proposition 1 (Cauchy criterion) A necessary and sufficient condition for the im- 
proper integral (17.10) depending on the parameter y e Y to converge uniformly on 
a set E C Y is that for every e > 0 there exist a neighborhood U[ a ^[(u>) of the point 
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ft) such that 



for every b\,b 2 & U[ a ^ m [{bd) and every y e E. 


(17.15) 


Proof Inequality (17.15) is equivalent to the relation | Fj, 2 (y) — f& 2 (y)| < £, so that 
Proposition 1 is an immediate corollary of the form (17.13) for the definition of 
uniform convergence of the integral (17.10) and the Cauchy criterion for uniform 
convergence on £ of a family of functions Fh(y ) depending on the parameter h <= 

[a,ft)[. □ 


As an illustration of the use of this Cauchy criterion, we consider the following 
corollary of it, which is sometimes useful. 

Corollary 1 If the function fin the integral (17.10) is continuous on the set [a, &>[ x 
[c, d] and the integral (17.10) converges for every y e ]c, d[ but diverges for y — c 
or y = d, then it converges nonuniformly on the interval ]c, d[ and also on any set 
E C ]c, d[ whose closure contains the point of divergence. 

Proof If the integral (17.10) diverges at y = c, then by the Cauchy criterion for con- 
vergence of an improper integral there exists so > 0 such that in every neighborhood 
U\ a ,m\ (&>) there exist numbers b \ , £>2 f° r which 



> £ 0 - 


(17.16) 


The proper integral 



dx 


is in this case a continuous function of the parameter y on the entire closed interval 
[c, d ] (see Proposition 1 of Sect. 17.1), so that for all values of y sufficiently close 
to c, the inequality 



> £ 


will hold along with the inequality (17.16). 

On the basis of the Cauchy criterion for uniform convergence of an improper in- 
tegral depending on a parameter, we now conclude that this integral cannot converge 
uniformly on any subset E C ]c, d[ whose closure contains the point c. 

The case when the integral diverges for y — d is, handled similarly. □ 
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Example 4 The integral 


r+oo 

Jo ‘ 


cIa 


converges for t > 0 and diverges at t — 0, hence it demonstrably converges nonuni- 
formly on every set of positive numbers having 0 as a limit point. In particular, it 
converges nonuniformly on the whole set {t e R | t > 0} of positive numbers. 

In this case, one can easily verify these statements directly: 


/•+oo 1 />+oo 

/ e~ tx dx= — ( 

Jb V t JbJt 


du 


Too as t -> TO. 


We emphasize that this integral nevertheless converges uniformly on any set {t e 
I t > to > 0} that is bounded away from 0, since 


1 r+°° 

0< — / 

V t Jby/t 


e u2 d u < f 

\t 0 Jb. 


+oo 


! " du 


0 as b —*■ Too. 


0 Jby/io 


c. Sufficient Conditions for Uniform Convergence of an Improper Integral 
Depending on a Parameter 

Proposition 2 (The Weierstrass test) Suppose the functions f(x, y) and g(x, y) are 
integrable with respect to x on every closed inten’al [a, b] C [a, co[for each value 
ofy e Y . 

If the inequality \f(x,y)\ < g(x,y ) holds for each value of y e Y and every 
i e [a, co[ and the integral 

g(x, y)dx 

converges uniformly on Y, then the integral 

f (x , >' ) dx 

converges absolutely for each y e Y and uniformly on Y . 




Proof This follows from the estimates 




g(x,y)dx 


and Cauchy’s criterion for uniform convergence of an integral (Proposition 1). □ 


The most frequently encountered case of Proposition 2 occurs when the function 
g is independent of the parameter y. It is this case in which Proposition 2 is usually 
called the Weierstrass M-testfor uniform convergence of an integral. 
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Example 5 The integral 



cos ax 
1 + x 2 


ck 


converges uniformly on the whole set R of values of the parameter a, since 
I fjfl I < and the integral /“ converges. 


Example 6 In view of the inequality | sin x e 


< e 


" , the integral 



sin x e 


dr, 


as follows from Proposition 2 and the results of Example 3, converges uniformly on 
every set of the form {t e R | t > to > 0}. Since the integral diverges for t = 0. on 
the basis of the Cauchy criterion we conclude that it cannot converge uniformly on 
any set having zero as a limit point. 


Proposition 3 (Abel-Dirichlet test) Assume that the functions f (x , v) and g{x, y) 
are integrable with respect to x at each y el on every closed interval [ a,b ] C 
[a, a>[. 

A sufficient condition for uniform convergence of the integral 

(f-g)(x,y)dx 



on the set Y is that one of the following two pairs of conditions holds : 
a\) either there exists a constant Mel such that 



< M 


for any ie[fl,©[ and any y ef and 

ft i ) for each y ef the function g(x, y) is monotonic with respect to x on the inter- 
val [a, a>[ and g(x, y) =4 0 on Y as x — > co, x e [a, &>[, 


or 

af) the integral 

f{x, y) dr 

converges uniformly on the set Y and 

ftl) for each y e Y the function g(x, y) is monotonic with respect to x on the inter- 
val [a, a>[ and there exists a constant Mel such that 

\g(x, y)| < M 

for every x e [a, a>[ and every y ef. 
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Proof Applying the second mean- value theorem for the integral, we write 

[ if • g)0, y) dx = g(buy) f f{x, y) dx + g(b 2 , y) [ fix, y) dx, 

J b\ J b\ t /£ 

where if e [h \ , bi\. If b\ and b 2 are taken in a sufficiently small neighborhood 
U[ a ,co[iu>) of the point a>, then the right-hand side of this equality can be made 
smaller in absolute value than any prescribed e > 0, and indeed simultaneously for 
all values of y e Y. In the case of the first pair of conditions ot \ ). f >\ ) this is obvious. 
In the case of the second pair a 2 ), f> 2 ), it becomes obvious if we use the Cauchy 
criterion for uniform convergence of the integral (Proposition 1). 

Thus, again invoking the Cauchy criterion, we conclude that the original integral 
of the product / • g over the interval |«, o>[ does indeed converge uniformly on the 
set Y of parameter values. □ 

Example 7 The integral 



as follows from the Cauchy criterion and the Abel-Dirichlet test for convergence 
of improper integrals, converges only for a > 0. Setting fix , a) — sinx, g(x , a) = 
x ~ a , we see that the pair a \ ), f>\ ) of hypotheses of Proposition 3 holds for a > 
ao > 0. Consequently, on every set of the form [a e R | a > ao > 0} this integral 
converges uniformly. On the set [a e K | a > 0} of positive values of the parameter 
the integral converges nonuniformly, since it diverges at a — 0. 

Example 8 The integral 



converges uniformly on the set {y e R | y > 0}. 

Proof First of all, on the basis of the Cauchy criterion for convergence of the im- 
proper integral one can easily conclude that for y < 0 this integral diverges. Now 
assuming y > 0 and setting fix, y) = g(x, y) = e~ xy , we see that the second 
pair a 2 ), f 2 ) of hypotheses of Proposition 3 holds, from which it follows that this 
integral converges uniformly on the set { y e R | y > 0}. □ 

Thus we have introduced the concept of uniform convergence of an improper 
integral depending on a parameter and indicated several of the most important tests 
for such convergence completely analogous to the corresponding tests for uniform 
convergence of series of functions. Before passing on, we make two remarks. 


Remark 1 So as not to distract the reader’s attention from the basic concept of uni- 
form convergence of an integral introduced here, we have assumed throughout that 
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the discussion involves integrating real-valued functions. At the same time, as one 
can now easily check, these results extend to integrals of vector-valued functions, in 
particular to integrals of complex- valued functions. Here one need only note that, as 
always, in the Cauchy criterion one must assume in addition that the corresponding 
vector space of values of the integrand is complete (this is the case for M, C, M" , 
and C"); and in the Abel-Dirichlet test, as in the corresponding test for uniform 
convergence of series of functions, the factor in the product / ■ g that is assumed to 
be a monotonic function, must of course be real-valued. 

Everything that has just been said applies equally to the main results of the fol- 
lowing subsections in this section. 

Remark 2 We have considered an improper integral (17.10) whose only singular- 
ity was at the upper limit of integration. The uniform convergence of an integral 
whose only singularity is at the lower limit of integration can be defined and stud- 
ied similarly. If the integral has a singularity at both limits of integration, it can be 
represented as 



where c e ]coi , a> 2 [, and regarded as uniformly convergent on a set E C Y if both of 
the integrals on the right-hand side of the equality converge uniformly. It is easy to 
verify that this definition is unambiguous, that is, independent of the choice of the 
point c e ]u>\, a> 2 [. 

17.2.2 Limiting Passage Under the Sign of an Improper Integral 
and Continuity of an Improper Integral Depending 
on a Parameter 

Proposition 4 Let f (x , y ) be a family of functions depending on a parameter y e Y 
that are integrable, possibly in the improper sense, on the interval a < x < u>, and 
let By be a base in Y . 


If 

a) for every b e [a, co[ 


fix, y) =4 <p(-r) on [a, b] over the base By 


and 


b) the integral j"‘ f{x, y) d.r converges uniformly on Y , 


then the limit function < p is improperly integrable on [a, a>[ and the following equal- 
ity holds : 



/* (o no) 

Jim / f(x,y)dx= / 
By Ja Ja 


(17.17) 
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Proof The proof reduces to checking the following diagram: 


Fb(y ) :=ff(x,y)dx 


By 

b T 

f ip(x) dx 


b—^ui 

bE[a,ui[ 


b—¥u; 

&£[a,u;[ 


f f(x,y)dx = : F{y) 
By 

f p{x) dx . 


The left vertical limiting passage follows from hypothesis a) and the theorem on 
passage to the limit under a proper integral sign (see Theorem 3 of Sect. 16.3). 

The upper horizontal limiting passage is an expression of hypothesis b). 

By the theorem on the commutativity of two limiting passages it follows from 
this that both limits below the diagonal exist and are equal. 

The right-hand vertical limit passage is what stands on the left-hand side of 
Eq. (17.17), and the lower horizontal limit gives by definition the improper inte- 
gral on the right-hand side of (17.17). □ 


The following example shows that condition a) alone is generally insufficient to 
guarantee Eq. (17.17) in this case. 


Example 9 Let T={yeR|y>0} and 


/(*.?) = 


1 /y, 

o, 


if 0 < x < y, 
if y < x. 


Obviously, f(x, y ) 0 on the interval 0 < x < +oo as y — >■ +oo. At the same 

time, for every y e Y, 



f(x,y)dx = 



fix, y) dx 


L 


y i 

— dx = 1 , 

y 


and therefore Eq. (17.17) does not hold in this case. 


Using Dini’s theorem (Proposition 2 of Sect. 16.3), we can obtain the following 
sometimes useful corollary of Proposition 4. 

Corollary 2 Suppose that the real-valued function fix , v) is nonnegative at each 
value of the real parameter y e Y C R and continuous on the interval a < x < co. 

If ' 

a) the function fix, y) is monotonically increasing as y increases and tends to 
a function (p(x) on [a, &>[, 

b) <p e C([a, a>[, M), and 

c) the integral tp(x) dx converges, 

then Eq. (17.17) holds. 
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Proof It follows from Dini’s theorem that fix, v) =4 (pix) on each closed interval 
[a, b] C [a, col- 
li follows from the inequalities 0 < fix, y) < tpix) and the Weierstrass M- test 
for uniform convergence that the integral of fix, y ) over the interval a < x < a> 
converges uniformly with respect to the parameter y. 

Thus, both hypotheses of Proposition 4 hold, and so Eq. (17.17) holds. □ 

Example 10 In Example 3 of Sect. 16.3 we verified that the sequence of functions 
f n (x ) = n ( 1 — .r 1 ) is monotonically increasing on the interval 0 < x < 1, and 

fn(x) / In ± as n +oo. 

Hence, by Corollary 2 


lim f n(l — x 1//n ) dx = ( In — dx. 
n ^°°Jo Jo x 


Proposition 5 If 

a) the function fix, y) is continuous on the set {(x, y) e R 2 | a < x < co A c < 
y < d], and 

b) the integral F(y) = f(x,y)dx converges uniformly on [c, d], then the 
function F(y) is continuous on [c, d]. 

Proof It follows from hypothesis a) that for any b e [a, co[ the proper integral 

Fbly)= f fix, }’) dx 
J a 

is a continuous function on [c, d\ (see Proposition 1 of Sect. 17.1). 

By hypothesis b) we have F^iy) ^ F(y) on [c, d] as b — >■ co, b e [a, <w[, from 
which it now follows that the function F{y) is continuous on [c, d], □ 


Example 11 It was shown in Example 8 that the integral 


F (y) = 



sinx 


e~ xy dx 


x 


(17.18) 


converges uniformly on the interval 0 < y < +oo. Hence by Proposition 5 one can 
conclude that F(y) is continuous on each closed interval [0, d] C [0, +oo[, that is, 
it is continuous on the entire interval 0 < y < +oo. In particular, it follows from this 
that 




f 


+oo 


sinx 


■ dx. 


lim 


+ °° sinx 


e~ xy dx = 


(17.19) 
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17.2.3 Differentiation of an Improper Integral with Respect 
to a Parameter 

Proposition 6 If 

a) the functions fix, y ) and fix, y) are continuous on the set { (jc , _y) e R 2 | 
a < x < a> A c < y < d}, 

b) the integral 0(y) = f'f ffx, y) dv converges uniformly on the set Y = [c, d], 
and 

c) the integral F(y) — ff fix, y) dv converges for at least one value of yo £ Y, 

then it converges uniformly on the whole set Y. Moreover the function F(y) is dif- 
ferentiable and the following equality holds'. 



Proof By hypothesis a), for every b e [a, oo[ the function 





is defined and differentiable on the interval c < y <d and by Leibniz’ rule 

(fft)'(y) = f fix, y) Ax. 


By hypothesis b) the family of functions (ffe) , (y) depending on the parameter 
b e [a, co[ converges uniformly on [c, d] to the function 0 (y) as b — >■ a>, b e [a, o>[. 
By hypothesis c) the quantity F^iyo) has a limit as b o>. b e [a, a>[. 

It follows from this (see Theorem 4 of Sect. 16.3) that the family of functions 
Fbiy) itself converges uniformly on [c, d] to the limiting function F(y) as b — »■ co, 
b e [a, o)[, the function F is differentiable on the interval c <y<d, and the equality 
F'iy ) = (y ) holds. But this is precisely what was to be proved. □ 

Example 12 For a fixed value a > 0 the integral 



converges uniformly with respect to the parameter y on every interval of the form 

{y e K | y > yo > 0}. This follows from the estimate 0 < x a s~ xy < x a e~ xyo < 
yo 

e 2 ) which holds for all sufficiently large x e R. 

Hence, by Proposition 6, the function 
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is infinitely differentiable for y > 0 and 


F (n) (y) = (-1) 


r+oo 

Jo 


x n e~ xy dx. 


But F(y) = -J:, and therefore F (n \y) = (— and consequently we can 
conclude that 


r+oo 

Jo ■ 


x"eT xy dx = 


,n + 1 


In particular, for y = 1 we obtain 

r -+00 


/ 


x"e X dx — n\. 


Example 13 Let us compute the Dirichlet integral 

p+OO 


/ 


sin x 


■ dr. 


jo x 

To do this we return to the integral (17.18), and we remark that for y > 0 

"+OO 


r-too 

F'(y ) = — / sin .re - * 1 ’ d.r, 

Jo 


(17.20) 


since the integral (17.20) converges uniformly on every set of the form {y e R | y > 

yo > 0 }. 

The integral (17.20) is easily computed from the primitive of the integrand, and 
the result is that 


F\y) = ■ 


1 


1 + y 2 


for y > 0, 


from which it follows that 

F(y) — — arctany + c for y > 0. 


(17.21) 


We have F(y) — > 0 as y —>■ +oo, as can be seen from relation (17.18), so that 
it follows from (17.21) that c = tt/ 2. It now results from (17.19) and (17.21) that 
F(0) = rt/2. Thus, 

r»+C© 

(17.22) 


/' 


sin x it 

d.r = — . 

x 2 


We remark that the relation “F(y) — > 0 as y -» +oo” used in deriving (17.22) is 
not an immediate corollary of Proposition 4, since ™±± e - x y 0 as y — +oo only 

on intervals of the form {x e R | x > xq > 0}, while the convergence is not uniform 
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on intervals of the form 0 < x < xo: for w | as x 0. But for xo > 0 we 

have 



and, given s > 0 we first choose xo so close to 0 that sin x > 0 for x e [0, xo] and 



for every y > 0. Then, after fixing xo, on the basis of Proposition 4, by letting y 
tend to Too, we can make the integral over [xo, Too[ also less than e/2 in absolute 
value. 


17.2.4 Integration of an Improper Integral with Respect 
to a Parameter 

Proposition 7 If 

a) the function f(x, y ) is continuous on the set {(x, y) el 2 | a < x < co A c < 
y < d] and 

b) the integral F (y) = f /(x, y) dx converges uniformly on the closed interval 


[c,d], 

then the function F is integrable on [c, d] and the following equality holds : 



(17.23) 


Proof For b e [ a , a>[, by hypothesis a) and Proposition 3 of Sect. 17.1 for improper 
integrals one can write 



(17.24) 


Using hypothesis b) and Theorem 3 of Sect. 16.3 on passage to the limit under 
an integral sign, we carry out a limiting passage on the left-hand side of (17.24) as 
b -» co, b e [a, a>[ and obtain the left-hand side of (17.23). By the very definition 
of an improper integral, the right-hand side of (17.23) is the limit of the right-hand 
side of (17.24) as b — > co, b e [a, co[. Thus, by hypothesis b) we obtain (17.23) from 

(17.24) as b^co. be [a,w[. □ 

The following example shows that, in contrast to the reversibility of the order of 
integration with two proper integrals, condition a) alone is in general not sufficient 
to guarantee (17.23). 
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Example 14 Consider the function fix, y) = (2 — xy)xy e~ xy on the set {(x, y) e 
R 2 | 0 < x < +oo A 0 < y < 1}. Using the primitive u 2 e~“ of the function (2 — 
u)ue~ l ‘ , it is easy to compute directly that 

p 1 p-\-oo p -f-oo p 1 

0 = dy I (2 — xy)xy e~ xy dx ^ / dx / (2 — xy)xy e~ xy dy = 1 . 

Jo ' Jo ” Jo Jo 

Corollary 3 If 

a) the function f(x, y) is continuous on the set P — {(x, y) e R 2 | a < x < 
a> A c < y < d} and 

b) nonnegative on P, and 

c) the integral F(y ) = f /(x, y) dx is continuous on the closed interval [c, d ] 
as a function of y, 

then Eq. (17.23) holds. 

Proof It follows from hypothesis a) that for every b e [n, &>[ the integral 

Fbiy)= f f(x,y) dx 

J a 

is continuous with respect to y on the closed interval [c, d]. 

It follows from b) that F^ (y) < Fb 2 (y ) for b\ <b 2 - 

By Dini’s theorem and hypothesis c) we now conclude that F/, F on [c, d] as 
b -> co, b e [a, a>[. 

Thus the hypotheses of Proposition 7 are satisfied and consequently Eq. (17.23) 
indeed holds in the present case. □ 

Corollary 3 shows that Example 14 results from the fact that the function /(x, y) 
is not of constant sign. 

In conclusion we now prove a sufficient condition for two improper integrals to 
commute. 


Proposition 8 If 

a) the function fix, y) is continuous on the set {(x,y) eM 2 |a<x < co A c < 

y <®1, 

b) both integrals 

p CO p CO 

F(y)= / / (x , y) dx , <P(x)= / / (x, y) dy 

J a J c 

converge uniformly, the first with respect to y on any closed interval [c, d] C [c, co], 
the second with respect to x on any closed inten’al [n, b] C [a, a>[, and 

c) at least one of the iterated integrals 



\f\(x, y) dx, 



\f\(x,y)dy 
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converges , then the following equality holds: 



fix, y)dx = 



fix, y) dy. 


(17.25) 


Proof For definiteness suppose that the second of the two iterated integrals in c) 
exists. 

By condition a) and the first condition in b) one can say by Proposition 7 that 
Eq. (17.23) holds for the function / for every d e [c, 5[. 

If we show that the right-hand side of (17.23) tends to the right-hand side of 
(17.25) as d (6, d e [c, 5[, then Eq. (17.25) will have been proved, since the left- 
hand side will then also exist and be the limit of the left-hand side of Eq. (17.23) by 
the very definition of an improper integral. 

Let us set 

<Pdix):=J^ fix, y) dy. 

The function <P,i is defined for each fixed d e [c, a>[ and, since / is continuous, 
@d is continuous on the interval a <x < u>. 

By the second of hypotheses b) we have 0d(x) =1 <£(.*) as d -* 5, d e [c, 5[ on 
each closed interval [a, b] C [a,w[. 

Since \<Pd{x)\ < f'° \f\(x, y)dy =: G(x) and the integral G(x)dx, which 
equals the second integral in hypothesis c), converges by hypothesis, we conclude 
by the Weierstrass M-test for uniform convergence that the integral f"‘ ( P,iix) d.v 
converges uniformly with respect to the parameter d. 

Thus the hypotheses of Proposition 4 hold, and we can conclude that 

PCD 

liny <Pd(x)dx= / @(x)dx; 

d—>oi J a 

de\c, 5] 

and that was precisely what remained to be verified. □ 


The following example shows that the appearance of the extra hypothesis c) in 
Proposition 8 in comparison with Proposition 7 is not accidental. 


Example 15 Computing the integral 



2 1 
x - y 


(x 2 + y~) 


2\2 



+oo 

A 


A 

A 2 + y 2 


1 

A 


for A > 0 shows at the same time that for every fixed value of A > 0 it converges 
uniformly with respect to the parameter y on the entire set of real numbers R. The 
same thing could have been said about the integral obtained from this one by replac- 
ing d.v with dy. The values of these integrals happen to differ only in sign. A direct 
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computation shows that 


n 

4 



■y 


(. X 2 + .V 2 ) 


2\2 




2 2 

x -y r\r = 7T 
(x 2 + y 2 ) 2 ' 4 ' 


Example 16 For a > 0 and f) > 0 the iterated integral 



dy 


/»+oo n+oo r+oo 

/ x 0l y a+p - 1 e- < ' 1+x)y Ax = / / e -- v dy / ( 

Jo Jo Jo 


(xy)° 




y Ax 


of a nonnegative continuous function exists, as this identity shows: it equals zero for 
y = 0 and / 0 + °° y^e~ y Ay ■ / 0 + °° u a e~ 11 Au for y > 0. Thus, in this case hypotheses 

a) and c) of Proposition 8 hold. The fact that both conditions of b) hold for this 
integral was verified in Example 3. Hence by Proposition 8 we have the equality 




x“y“+^+l e -(l + - T ). v dx 



x °i u+p+l Q -(\+x)y 


Ay. 


Just as Corollary 3 followed from Proposition 7, we can deduce the following 
corollary from Proposition 8. 


Corollary 4 If 

a) the function fix, y) is continuous on the set 

P = j(i, y) e l 2 | a < x < wA c < y < 5), and 

b) is nonnegative on P, and 

c) the two integrals 

nco rco 

F{y)= / fix, y) Ax, <P(x) = / fix, y) Ay 

J a J c 

are continuous functions on [a, w[ and [c, 5[ respectively, and 

d) at least one of the iterated integrals 

no) p (O pcq p (0 

/ dy / fix, y) Ax, / dx / fix, y) Ay, 

J c J a J a J a 

exists, then the other iterated integral also exists and their values are the same. 


Proof Reasoning as in the proof of Corollary 3, we conclude from hypotheses a), 
b), and c) and Dini’s theorem that hypothesis b) of Proposition 8 holds in this case. 
Since / > 0, hypothesis d) here is the same as hypothesis c) of Proposition 8. Thus 
all the hypotheses of Proposition 8 are satisfied, and so Eq. (17.24) holds. □ 

Remark 3 As pointed out in Remark 2, an integral having singularities at both limits 
of integration reduces to the sum of two integrals, each of which has a singularity 
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at only one limit. This makes it possible to apply the propositions and corollaries 
just proved to integrals over intervals ]a>i, a>2[ C R. Here naturally the hypotheses 
that were satisfied previously on closed intervals [a, b] C [a. a>[ must now hold on 
closed intervals [a, b] C ]&q, a> 2 [- 


Example 77 By changing the order of integration in two improper integrals, let us 
show that 




(17.26) 


This is the famous Euler-Poisson integral. 


Proof We first observe that for y > 0 


J 




- ( ^ 2 dx, 


and that the value of the integral in (17.26) is the same whether it is taken over the 
half-open interval [0, +oo[ or the open interval ]0, +oo[. 

Thus, 





du = J 2 , 


and we assume that the integration on y extends over the interval ]0, +oo[. 

As we shall verify, it is permissible to reverse the order of integration over x and 
y in this iterated integral, and therefore 


J 2 — 



r+oo 

dx ye-° +x R dy = 

Jo 



dx 

1 + x 2 


it 

4’ 


from which Eq. (17.26) follows. 

Let us now justify reversing the order of integration. 
The function 



ye 


(1 +x 2 )y 2 


dy = 


1 1 

2 1 + x 2 


is continuous for x > 0, and the function 



y e -(i+* 2 ))' 2 dx — e~y 2 . J 


is continuous for y > 0. Taking account of the general Remark 3, we now conclude 
from Corollary 4 that this reversal in the order of integration is indeed legal. □ 
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17.2.5 Problems and Exercises 


1. Let a = flo < «i <•••<«« < • • ■ < co. We represent the integral (17.10) as the 

sum of the series ^2nLi < Pn(y). where (p n (y) — f“" ] f(x, y)dx. Prove that the inte- 
gral converges uniformly on the set E C Y if and only if to each sequence { a n } of 
this form there corresponds a series qj n (y ) that converges uniformly on E. 

2. a) In accordance with Remark 1 carry out all the constructions in Sect. 17.2.1 
for the case of a complex- valued integrand /. 

b) Verify the assertions in Remark 2. 


3. Verify that the function Jq(x) — ^ dr satisfies Bessel’s equation y" + 

\y' + y = 0. 

4. a) Starting from the equality / 0 + °° t2 +’ v 2 — show that / 0 +O ° 

(2n— 3)!! 1 

( 2 / 1 — 2 )!! ' x 2 "- 1 ' 

b) Verify that / c 


d>’ 7 r 

(x 2 +y 2 ) n 2 


+oo dj 7 z (2/i — 3)!! / — 

0 (1+0 > 2 /n))» - 2 (2/1—2)!! V ” ■ 


c) Show that (1 + ( y 2 /n )) " \ e y oniasn + +oo and that 


lim 

n — >+oo 


r+oo 

JO 


dy 


r+oo 

Jo ‘ 


(1 + ( y 2 /n)) n 

d) Obtain the following formula of Wallis: 

(2rz — 3)!! 1 

lim = — — . 

n->oo (2n — 2)!! y/jt 


dy. 


5. Taking account of Eq. (17.26), show that 

a) / 0 + °° e _jr cos 2xydx = ^ s /rceT y2 . 

b) / 0 +o ° e - * 2 sin2xy dx — e~ y2 / 0 ' e r dr. 

6. Assuming r > 0, prove the identity 



e 


—tx 


1 +x 2 


d.r 



mu li — r) 

dx, 

x 


using the fact that both of these integrals, as functions of the parameter r, satisfy the 
equation y + y = 1/r and tend to zero as r —*■ +oo. 

7. Show that 


[ K(k)dk= [ 7 -J?—d(p(= f 
Jo Jo sln <P \ Jo 


arctan x 


■ cLr 


where K (k) = fJJ' . d<p = is the complete elliptic integral of first kind. 

u V I ~k 2 sin 2 tp 
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8. a) Assuming that a > 0 and b > 0 and using the equality 


p+o o pb p 

/ dx / e~ xy dy = I 
JO Ja Jo 


-}-oo Q—ax Q~b x 


■ dx, 


compute this last integral. 

b) For a > 0 and b > 0 compute the integral 


L 


-f-oo Q~ax q~^ x 


- cosx dx. 


c) Using the Dirichlet integral (17.22) and the equality 


f ' +00 cosax — cos bx 


f + °° dx r r 

/ — / sinxvdv= / ^ dx, 

Jo x Ja Jo 


compute this last integral. 
9 . a) Prove that for k > 0 


r+oo p+oo 

/ e~ kt sintdt / t 

Jo Jo 


e~ ni du = 


p+OO p+OO 

I dii I e ~( k + u s j n f dr. 

Jo Jo 


b) Show that the preceding equality remains valid for the value k — 0. 

c) Using the Euler-Poisson integral (17.26), verify that 


1 


2 r + °° 2 

' t~ ,u i 


i 


■sTt yfn 

d) Using this last equality and the relations 


r+°° , 1 r + °° sinr 

/ stnx dx = - / — ^dr, 

Jo 2 Jo VF 


f+°° ? 1 f 

/ cosx dx = - 

Jo 2 J o 


1 f + °° cos t 


obtain the value (J,/ j) for the Fresnel integrals 


p+oo 

Jo ' 


sin x“ dx, 


p+oo 

Jo ‘ 


cosx” dx. 


10 . a) Use the equality 

r»4-00 


f +0 ° sinx f +°° f + °° 

/ dx = / sinxdx / i 

Jo x Jo Jo 


e~ xy dy 


d t, 


and, by justifying a reversal in the order of integration in the iterated integral, obtain 
once again the value of the Dirichlet integral (17.22) found in Example 13. 
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b) Show that for a > 0 and ft > 0 


r+oo 

Jo 


sin cm 


■ cos fix dx = 


§, iffi<a, 

i if fi = u, 

0, if fi > a. 


This integral is often called the Dirichlet discontinuous factor. 
c) Assuming a > 0 and fi > 0, verify the equality 


L 


+0 ° sin cm sin /8.x 


dx = 


^fi, if fi < a, 
\ol, if a < fi. 


d) Prove that if the numbers a, a\, . . . , a n are positive and a > a i • th en 


r+oo 

Jo 


stnax sinctix stna„x i r 

■ • • dx = — ot\ot2 ■ ■ ■ ct n . 

xx x 2 


11 . Consider the integral 


/* 0) 

F(y)= / f(x,y)g(x) dx, 

J a 

where g is a locally integrable function [a, a>[ (that is, for each b e [a, e 

'IZ[a. /?]). Let the function / satisfy the various hypotheses a) of Propositions 5-8. 
If the integrand /(x, y) is replaced by f(x, y ) • g(x) in the other hypotheses of 
these propositions, the results are hypotheses under which, by using Problem 6 of 
Sect. 17.1 and repeating verbatim the proofs of Propositions 5-8, one can conclude 
respectively that 

a) T e C[c, d]; 

b) J r eC (1) [c,c/],and 


, r 0J df 

F (y)= / J: (x, y)g(x) dx; 
J a oy 


c) T e lZ[c, d] and 


pd pco / pd 

l U 


fix, y)g{x)dy dx; 


c) T is improperly integrable on [c, o>[, and 


n co no) / pa 

l ™*-J. (1 


fix, y)g(x)dy dx. 


Verify this. 
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17.3 The Eulerian Integrals 

In this section and the next we shall illustrate the application of the theory devel- 
oped above to some specific integrals of importance in analysis that depend on a 
parameter. 

Following Legendre, we define the Eulerian integrals of first and second kinds 
respectively as the two special functions that follow: 

B(a, f) f x a ~ l (1 — djr, 

Jo 

r+oo 

r(a) I x a ~ l e~ x dx. 

Jo 

The first of these is called the beta function, and the second, which is the most 
frequently used, is the gamma function of Euler. 1 


(17.27) 

(17.28) 


17.3.1 The Beta Function 

a. Domain of Definition 

A necessary and sufficient condition for the convergence of the integral (17.27) at 
the lower limit is that a > 0. Similarly, convergence at 1 occurs if and only if f > 0. 

Thus the function B(a, f) is defined when both of the following conditions hold 
simultaneously: 

a > 0 and f > 0. 

Remark We are regarding a and f> as real numbers here. However, it should be kept 
in mind that the most complete picture of the properties of the beta and gamma 
functions and the most profound applications of them involve their extension into 
the complex parameter domain. 


b. Symmetry 

Let us verify that 

B(a,P) = B(P,a). (17.29) 

Proof It suffices to make the change of variable x = 1 — t in the integral (17.27). □ 


*L. Euler (1707-1783), a brilliant scientist and above all a mathematician and specialist in me- 
chanics. If one were to select a name, after the names of Newton and Leibniz, for a professional 
mathematician, that name would likely be pronounced “Euler”. Euler’s works and ideas still per- 
meate almost all areas of modem mathematics. Swiss by birth, he spent a significant part of his life 
living and working in Russia, where he was buried. 
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c. The Reduction Formula 


If or > 1, the following equality holds: 

B(a,P)= “~ 1 (17.30) 

a + p - 1 

Proof Integrating by parts and carrying out some identity transformations for a > 1 
and f> > 0, we obtain 

B(a, P) = -^“"‘(l - Ij + °^~ ■ ^ x ““ 2(1 - x)P dx = 

= °— — f x a ~ 2 {{\ — x)P~ l — (1 — x)P~ { x) dx = 

P Jo 

a — 1 a — 1 

= — — B((y - 1 , fi) - — - — B(a, P), 

P P 

from which the reduction formula (17.30) follows. □ 


Taking account of formula (17.29), we can now write the reduction formula 

B(a,P)= B(a,p-l) (17.30') 

a-\- P - 1 

on the parameter ft, assuming, of course, that f > 1. 

It can be seen immediately from the definition of the beta function that B(a, 1) = 
- , and so for n e N we obtain 

OL ’ 


B(a, n ) 


n — 1 n — 2 
a + n — 1 a + n — 2 
(n - 1)! 

a(a + 1) ■ . . . • (a + n — 1) 


n — (n — 1) 
a + n — (n — 1) 


B(ot, 1) = 


(17.31) 


In particular, for m, n e N 


B(m, n) 


(m - 1 )!(« - 1)! 
(m + n — 1)! 


(17.32) 


d. Another Integral Representation of the Beta Function 


The following representation of the beta function is sometimes useful: 
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Proof This representation can be obtained from (17.27) by the change of variables 
x = T+y ■ D 

17.3.2 The Gamma Function 

a. Domain of Definition 

It can be seen by formula (17.28) that the integral defining the gamma function 
converges at zero only for a > 0, while it converges at infinity for all values of 
a e R, due to the presence of the rapidly decreasing factor e~ x . 

Thus the gamma function is defined for a > 0. 

b. Smoothness and the Formula for the Derivatives 

The gamma function is infinitely differentiable, and 



(17.34) 


Proof We first verify that the integral (17.34) converges uniformly with respect to 
the parameter a on each closed interval [o, b] C ]0, +oo[ for each fixed value of 
n e N. 


If 0 < a < a, then (since x a ^~ In" x — >■ 0 as x -» +0) there exists c n > 0 such 


that 



for 0 < x < c n . Hence by the Weierstrass M-test for uniform convergence we con- 
clude that the integral 


converges uniformly with respect to a on the interval [a, +oo[. 
If a <b < Too, then for x > 1, 




and we conclude similarly that the integral 



converges uniformly with respect to a on the interval ]0, /;]. 
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Combining these conclusions, we find that the integral (17.34) converges uni- 
formly on every closed interval [c/ , /;] c ]0, +oo[. 

But under these conditions differentiation under the integral sign in (17.27) is 
justified. Hence, on any such closed interval, and hence on the entire open interval 
0 < a, the gamma function is infinitely differentiable and formula (17.34) holds. □ 


c. The Reduction Formula 

The relation 

r(a+ l) = ar(a) ( 17 . 35 ) 

holds. It is known as the reduction formula for the gamma function. 

Proof Integrating by parts, we find that for a > 0 

/•+oo p+oo 

r(a+ 1):=/ x“e- x d.v = — x“e- l '|+°° + a / X a ~ 1 e~ x djc = 

Jo Jo 

r+oo 

= a J x a ~ l s~ x dx — ar(a). q 

Since C(l) = / 0 + °° e _A dr = 1, we conclude that for n e N 

r{n + 1 ) = n\. ( 17 . 36 ) 

Thus the gamma function turns out to be closely connected with the number- 
theoretic function n ! . 


d. The Euler-Gauss Formula 


This is the name usually given to the following equality: 


r(a) = lim n“ • 

n— >oo 


in - 1 )! 

a (a + 1 )■...■ (a + n — 1 ) 


(17.37) 


Proof To prove this formula, we make the change of variable x = In ^ in the integral 
(17.28), resulting in the following integral representation of the gamma function: 

r(a) = J ln“ -1 Q)di/. (17.38) 

It was shown in Example 3 of Sect. 16.3 that the sequence of functions 
f n (u) — n( 1 — if' n ) increases monotonically and converges to ln(^) on the in- 
terval 0 < u < 1 as n — »■ oo. Using Corollary 2 of Sect. 17.2 (see also Example 10 
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of Sect. 17.2), we conclude that for a > 1 


[ ln“ -1 (-^d M = lim n a ~ l \ (1 - w 1/,! )" 1 du. (17.39) 
Jo W Jo 

Making the change of variable u — v n in the last integral, we find by (17.38), 
(17.39), (17.27), (17.29), and (17.31) that 


r(a)= lim n a f v n ~ l (1 - vf~ l 
n—^-oo J Q 


du = 


= lim n 01 B(n,a) = lim n a B(a,n) = 

n— >oo n — > oo 


= lim n a • 


in - 1)! 


n->oo a(a + 1) ■ . . . • (a + n — 1) 

Applying the reduction formulas (17.30) and (17.35) to the relation r(a) — 
lim„^oo n a B(a, n) just proved for a > 1, we verify that formula (17.37) holds for 
all a > 0. □ 


e. The Complement Formula 

For 0 < a < 1 the values a and 1 — a of the argument of the gamma function are 
mutually complementary, so that the equality 

JT 


r(a) ■ r( 1 — a) — (0 < a < 1 ) 

sinjroi 


(17.40) 

is called the complement formula for the gamma function. 

Proof Using the Euler-Gauss formula (17.37) and simple identities, we find that 

/ „ (n - 1)! 

r(a)U(l -O') = lim \n a 


n- i-ooy a (a + 1) ■ . . . • (a + n — 1) 

l—a (»-D! 


x n 


= lim I n 


(1 — o') (2 — a) ■ ... • (n — o;) 

1 

™ V “(1 + f)-----a + ^r) x 
1 

X (l-f)(l-|)-...-(l-^)(n-a) 

1 1 

= — lim ^ ^ r — 

0'„^00( 1 -^_ )(1 -| ?) . .(!-«) 
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Hence for 0 < a < 1 


1 1 

r(a)r(l-a) = - F[ r . (17.41) 

a 1 1 i _ <£L 
"= 1 1 n 2 

But the following expansion is classical: 

00 / 2 \ 

sin7ro: = uu tl' 4 ' (17.42) 

n= 1 ' ' 

(We shall not take the time to prove this formula just now, since it will be obtained 
as a simple example of the use of the general theory when we study Fourier series. 

See Example 6 of Sect. 18.2.) 

Comparing relations (17.41) and (17.42), we obtain (17.40). □ 

It follows in particular from (17.40) that 

rQ)=V5F. (17.43) 


We observe that 


".a-r 


" 1/2 e _;t dx = 2 


r+oo 

Jo { 


dw, 


and thus we again arrive at the Euler-Poisson integral 


r+oo 

Jo 


e “ d« = -■ s / jt . 


17.3.3 Connection Between the Beta and Gamma Functions 


Comparing formulas (17.32) and (17.36), one may suspect the following connec- 
tion: 




r(a) • rtp) 

r(a + p) 


(17.44) 


between the beta and gamma functions. Let us prove this formula. 


Proof We remark that for y > 0 


r(a) = y l 


r+oo 

Jo 


K a ~ l €~ xy dx, 
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and therefore the following equality also holds: 

r(a + P)-y a -' ' ,+0 ° 


(1 + y) a+ P 

using which, taking account of (17.33), we obtain 


r~ f-oo 

= y“-’ / ^a+^-tg-d+y^djc, 

Jo 


r(a + P)-B(a,P) = 


+°° r(u + p)y a -' 


dy = 


Jo (1 + y) a+ P 

= r( ya ~ l f Q + °° xa+P ~ le ~ (1+y)X dx ) d - v = 

= ^ +0 °(^ +0 ° y a - 1 x“ + ^ _1 e _(1+3 ' ) * dy^ dr = 
r+oo / /-+oo \ 

= J hc^-'e-* J (xy) a - l e- xy x dy) dx = 

r+oo / r+oo \ 

= j lx p - l e~ x J w“ _1 e _ " du\ 


d« ) dx = r(a) • r(j8). 


All that remains is to explain the equality distinguished by the exclamation point. 
But that is exactly what was done in Example 16 of Sect. 17.2. □ 


17.3.4 Examples 


In conclusion let us consider a small group of interconnected examples in which the 
special functions B and /’ introduced here occur. 


Example 1 

sm 0, - l <pcos (l - 1 <pd<p= Q- 


(17.45) 


Proof To prove this, it suffices to make the change of variable sin 2 <p = x in the 
integral. □ 


Using formula (17.44), we can express the integral (17.45) in terms of the gamma 
function. In particular, taking account of (17.43), we obtain 


, f */2 

/ sin " - 1 ip dtp = / 

Jo Jo 


„_i , V* r (|) 

cos (pd<p = 




(17.46) 


Example 2 A one-dimensional ball of radius r is simply an open interval and its 
(one-dimensional) volume Vi (r ) is the length (2r) of that interval. Thus V\ (r ) = 2 r. 


440 


1 7 Integrals Depending on a Parameter 


If we assume that the ((n — 1) -dimensional) volume of the (n — 1) -dimensional 
ball of radius r is expressed by the formula V,,_i(r) = c„_ir" , then, integrating 
over sections (see Example 3 of Sect. 11.4), we obtain 

V n (r) = J c n -i(r 2 - x 2 ) ^ dx = |c„_ i J cos" cp d<p^ ■ r n , 

that is, V n (r) = c n r n , where 


f *^/2 


Cjj — 2 c n — 


r 

n -\ I cos n (pd(p. 

Jo 


By relations (17.46) we can rewrite this last equality as 


Cn — \fjT 


r( 4 1 ) 
r(^) 


Cn— 1 > 


so that 


or, more briefly, 


c n = U/jr) 


i^cf) 


r(*&) r{ n + 1 ) 


-=1 

c n=n 2 Cl- 


r( ,2 ¥) 

But ci = 2, and F(|) = \r{^) = Ij^/rc, so that 

n 

71 2 


r{*¥) 


Consequently, 


or, what is the same, 


VM =lWf 


y n(r)= 

2 1 1 2> 


n I) 

/’(4) 


• Cl 


(17.47) 


Example 3 It is clear from geometric considerations that d V n (r) = .S'„_ i (r) dr, 
where S n -i(r) is the (n — 1) -dimensional surface area of the sphere bounding the 
n -dimensional ball of radius r in K". 

Thus S n -i(r) = ^4(r), and, taking account of (17.47), we obtain 


S„-\(r) = 


2 JT 2 


n - 1 
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17.3.5 Problems and Exercises 


1. Show that 

a) B(l/2, 1/2) = tt; 

b) B(a, 1 - a) = / 0 °° dx; 


c) §f(a,/6) = /o Ul-x/ ' Inxdx; 
f+oo . v Pdi-_ 

” _ 1 


^ fo ( a+bx q ) r 


e) 


r+°° dt 


= £±i); 


0 yi+x" « sin j 

rr. r+oo dx 27T . 

I; ^0 i+. v 3 - 373’ 

+00 jc “ _1 dx 71 


g) /o 

h) /o 


(0 < a < 1); 


-w^^ = ^(isk)(o<«<D; 

i) the length of the curve defined in polar coordinates by the equation r" — 
a n cos ncp, where n e N and a > 0, is ctB( j, 

2. Show that 

a) r(l) = r(2); 

b) the derivative r' of r is zero at some point xo e ] 1 , 2[; 

c) the function r' is monotonically increasing on the interval ]0, +oo[; 

d) the function /’ is monotonically decreasing on ]0, xo] and monotonically in- 
creasing on [xo, +oo[; 

e) the integral fg (In lnln ^ d« equals zero if x = xq; 

f) r(a) ~ as a -> +0; 


g) lim„^oo/ 0 


+°° „-x 


dx = 1. 


Euler’s formula E \= YYk=i 

a) Show that £ 2 = ^1,' r^)r^). 

b) Verify that E 1 = 


sin — sin2 — -...•sin(n— 1) — 


. 2&7T 


c) Starting from the identity p = Ylk=i ( z ~ e * ” )» l et £ tend to 1 to obtain 


n— 1 

=na 

k=\ 


. 2 kn 

■ e' " ), 


and from this relation derive the relation 


n = 2" 


W— 1 

-n* 

k= 1 


fc7T 


d) Using this last equality, obtain Euler’s formula. 
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4. Legendre’s formula r(a)r(a + j) = r(2a). 

a) Show that B(a , a) = 2 / Q 1/2 (j - (| - dx. 

b) By a change of variable in this last integral, prove that B(a,a ) = 

22a — 1 B(-2 ’ 

c) Now obtain Legendre’s formula. 


5. Retaining the notation of Problem 5 of Sect. 17.1, show a route by which the sec- 
ond, more delicate part of the problem can be carried out using the Euler integrals. 


a) Observe that k = k for k = -^= and 


E = 


E = 



L !~\ 

1 — - sin ipdip, 


K = K = 


nn/2 

JO 


dip 


1 — j sin 2 ip 


b) After a suitable change of variable these integrals can be brought into a form 
from which it follows that for k — 1 /V2 


K = 


1 

2V2 


5(1/4, 1/2) 


and 2 E — K 


1 

2V2 


5(3/4. 1/2). 


c) It now results that for k = 1 /y/2 

EK + EK - KK =n/2. 


6 . Raabe’s 2 integral fd lnE(x) dx. 

Show that 

a) f ( ] In r(x) dx = /J In U(1 — x) dx. 

b) f Q l In r (x) dx = j In jt — ^ In sin x dx . 

c) In sinx dx = In sin 2x dx — § In 2. 

d) f^ 2 In sin x dx = — ^ In 2 . 

e) fd In r(x) dx = In s/2tt. 

7. Using the equality 


1 

X s 


1 

ru) 



dy 


and justifying the reversal in the order of the corresponding integrations, verify that 

a) /o «« d> = _^ ( 0 <„<,). 


2 J.L. Raabe (1801-1859) - Swiss mathematician and physicist. 
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c) Now obtain once again the value of the Dirichlet integral / 0 + °° dx and 
the value of the Fresnel integrals / 0 + °° cosx 2 dx and / 0 + °° sinx 2 dx. 

8. Show that for a > 1 

r+o o x o/-l 

/ — -d x = r(u) ■ {(a), 

Jo e - 1 

where £(a) = YlnLi Tfi the Riemann zeta function. 

9. Gauss’ formula. In Example 6 of Sect. 16.3 we exhibited the function 


OO 

F(a, P, y,x ) 1 + ^ 

n = 1 


oi (a + 1)- • ■ (a + n - l)P(P + 1) • • • (ft + n - 1) n 
n!y(y + 1) ■ • ■ (y +n - 1) 


which was introduced by Gauss and is the sum of this hypergeometric series. It turns 
out that the following formula of Gauss holds: 


F(a,p,y, 1) = 


r(y)-r(y-a-P) 
F(y — a) ■ r(y — f) 


a) By developing the function (1 — tx) * in a series, show that for a > 0, y 
a > 0, and 0 < x < 1 the integral 


P(x)= f 
Jo 


d t 


can be represented as 


P{ X ) = Y J Pn-X n , 


where P n 


■ _ r(a+n)-r(y-a) 

r(y+n ) 


b) Show that 

_ F(a) • T(y — a) a(a + 1 ) ■ • • (a + n - l)P(P + 1 ) • • • (P + n - 1 ) 
F(y) n!y(y + l)--- (y +n — 1 ) 

c) Now prove that for a > 0, y — a > 0, and 0 < x < 1 

r{a) ■ r(y- a) 

P(x)= — F(a,P,y,x). 

d) Under the additional condition y — a — p > 0 justify the possibility of passing 
to the limit as x —> 1 — 0 on both sides of the last equality, and show that 


r (a) • F(y — a — P) F(a) ■ r{y — a) 


r(y-P) 


r(y) 


F(a,p,y, 1), 


from which Gauss’ formula follows. 
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10. Stirling’s 3 formula. Show that 



e 12(n+l) 



f) b„ = a n c 12" is a monotonically increasing sequence; 

e„ 

g) n ! = cn" +1//2 e ,,+ I2n , where 0 <0„ < 1, and c = lim,,-^ a n = lim„_ i . 00 b n ; 

h) the relation sin ttx = nx ^ ) with x = 1/2 implies Wallis’ formula 


(n\) 2 2 2n 1 

lim • — =; 

n-s-oo (2 n)\ y/n 


i) Stirling’s formula holds: 



0 < 9„ < 1 ; 


j) r(x + 1) ~ \Fhzx(ff) x as x -+ +oo. 

11. Show that r(x) = ( ,7+v ' h. + fi° t x ~ l e~‘ dt . This relation makes it 

possible to define r(z) for complex z e C except at the points 0,-1, —2, 


17.4 Convolution of Functions and Elementary Facts 
About Generalized Functions 

17.4.1 Convolution in Physical Problems ( Introductory 
Considerations ) 

A variety of devices and systems in the living and nonliving natural world carry out 
their functions responding to a stimulus / with an appropriate signal /. In other 
words, each such device or system is an operator A that transforms the incoming 
signal / into the outgoing signal f — Af. Naturally, each such operator has its own 
domain of perceivable signals (domain of definition) and its own form of response 
to them (range of values). A convenient mathematical model for a large class of 
actual processes and machines is a linear operator A that preserves translations. 


3 J. Stirling (1692-1770) - Scottish mathematician. 
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Definition 1 Let A be a linear operator acting on a vector space of real- or complex- 
valued functions defined on R. We denote by T, o the shift operator or translation 
operator acting on the same space according to the rule 


(T to f)(t):=f(t-t 0 ). 

The operator A is translation-invariant (or preserves translations ) if 

A(T, 0 f) = T, 0 (Af ) 

for every function / in the domain of definition of the operator A. 

If t is time, the relation A o T to — T, 0 o A can be interpreted as the assumption 
that the properties of the device A are time-invariant: the reaction of the device to 
the signals fit) and fit — 1ft) differ only in a shift by the amount ft in time, nothing 
more. 

For every device A the following two fundamental problems arise: first, to predict 
the reaction / of the device to an arbitrary input /; second, knowing the output /, 
to determine, if possible, the input signal /. 

At this point we shall solve the first of these two problems heuristically in appli- 
cation to a translation-invariant linear operator A. It is a simple, but very important 
fact that in order to describe the response / of such a device A to any input signal 
/, it suffices to know the response £ of A to a pulse 8. 

Definition 2 The response E (f ) of the device A to a unit pulse S is called the system 
function of the device (in optics) or the transient pulse function of the device (in 
electrical engineering). 

As a rule, we shall use the briefer term “system function". 

Without going into detail just yet, we shall say that a pulse can be imitated, for 
example, by the function S a (t ) shown in Fig. 17.1, and this imitation is assumed to 
become closer as the duration a of the “pulse" gets shorter, preserving the relation 
a ■ ^ = 1. Instead of step functions, one may imitate a pulse using smooth functions 
(Fig. 17.2) while preserving the natural conditions: 

f a > o, f fait) dt = 1, [ fait) dt -» 1 as a 0, 

JR Ju( 0) 

where U (0) is an arbitrary neighborhood of the point t = 0. 

The response of the device A to an ideal unit pulse (denoted, following Dirac, 
by the letter 8) should be regarded as a function E (?) to which the response of the 
device A to an input approximating 8 tends as the imitation improves. Naturally a 
certain continuity of the operator A is assumed (not made precise as yet), that is, 
continuity of the change in the response / of the device under a continuous change 
in the input /. 
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Fig. 17.1 



S a (t) 


0 a t 



Fig. 17.3 



For example, if we take a sequence [A n (t)} of step functions A n (t) := S\/ n (t) 
(Fig. 17.1), then, setting AA n =: E n , we obtain AS := E — lim^oo E„ = 
litit/i— >oo AA n . 

Let us now consider the input signal / in Fig. 17.3 and the piecewise constant 
function Z/,(f) = /(r ; )<5/,(t — ti)h. Since //, -> / as h -> 0, one must assume 

that 

lh — Alh — > Af = f as h — ¥■ 0. 

But if the operator A is linear and preserves translates, then 


hit) = f(xj)E h (t - n)h, 


where £/, = A&h . Thus, as h — > 0 we finally obtain 

f(t)= f f(r)E(t- r)dr. (17.48) 

Jr 

Formula (17.48) solves the first of the two problems indicated above. It represents 
the response f(t ) of the device A in the form of a special integral depending on the 
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parameter t. This integral is completely determined by the input signal /(f) and 
the system function E(t) of the device A. From the mathematical point of view the 
device A and the integral (17.48) are simply identical. 

We note incidentally that the problem of determining the input signal from the 
output / now reduces to solving the integral equation (17.48) for /. 

Definition 3 The convolution of the functions u : M —*■ C and v : M —*■ C is the 
function u * v : R. — > C defined by the relation 

(n * v)(x) := I u(y)v(x — y)dv, (17.49) 

Jr 

provided this improper integral exists for all x e R. 

Thus formula (17.48) asserts that the response of a linear device A that preserves 
translates to an input given by the function / is the convolution f *E of the function 
/ and the system function E of the device A. 


17.4.2 General Properties of Convolution 

Now let us consider the basic properties of convolution from a mathematical point 
of view. 


a. Sufficient Conditions for Existence 

We first recall certain definitions and notation. 

Let / : G — > C be a real- or complex-valued function defined on an open set 

Gci. 

The function / is locally integrable on G if every point x e G has a neighbor- 
hood U(x) C G in which the function f\u(x) i s integrable. In particular, if G = E, 
the condition of local integrability of the function / is obviously equivalent to the 
relation f\[ a ,b\ € TZ[a, b] for every closed interval [a. b]. 

The support of the function f (denoted supp /) is the closure in G of the set 
! v C G fix) -/()}. 

A function / is of compact support (in G ) if its support is a compact set. 

The set of functions / : G — > C having continuous derivatives in G up to order m 
(0 < in < oo) inclusive, is usually denoted C (, " ) (G) and the subset of it consisting of 
functions of compact support is denoted Cq"\G). In the case when G = R, instead 
of C (m) (R) and Cq" ) (K) it is customary to use the abbreviation C (m) and 
respectively. 

We now exhibit the most frequently encountered cases of convolution of func- 
tions, in which its existence can be established without difficulty. 
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Proposition 1 Each of the conditions listed below is sufficient for the existence of 
the convolution u * v of locally integrable functions u : M — * C and v : R — > C. 

1) The functions \u\ 2 and |v| 2 are integrable on R. 

2) One of the functions \u\, \ v \ is integrable on R and the other is bounded on R. 

3) One of the functions u and v is of compact support. 


Proof 1) By the Cauchy-Bunyakovskii inequality 

2 


[ |t<(y)u(x-y)|dy^ < [ \u\ 2 (y)dy ( \v\ 2 {x-y)dy, 
Jr J Jr Jr 


from which it follows that the integral (17.49) exists, since 


/ +oo 

I 

-OO 


MO - y) dy 


/ +00 

-OO 


\v\ (y)d>t. 


2) If, for example, \u\ is integrable on R and |n| < M on R, then 

/ \u(y)v(x — y)\dy < M / |m| ( y) dy < +oo. 
J R IS 


3) Suppose suppw c[a,b]cR. Then obviously 

/ u(y)v(x — y)dy = / u(y)v(x - y) dy. 

J R J a 

Since u and v are locally integrable, this last integral exists for every value of 
x e R. 

The case when the function of compact support is v reduces to the one just con- 
sidered by the change of variable x — y = z. □ 


b. Symmetry 

Proposition 2 If the convolution u * v exists, then the convolution v * u also exists, 
and the following equality holds: 

u*v = v*u. (17.50) 

Proof Making the change of variable x — y = z in (17.49), we obtain 

/ +oo r+oo 

u{y)v{x — y) dy = / v(z)u(x — z)dz =: v * u(x). ^ 

-OO J — OO 
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c. Translation Invariance 


Suppose, as above, that T xo is the shift operator, that is, (T xo ) f(x ) = fix — .cq)- 

Proposition 3 If the convolution u * v of the functions u and v exists, then the 
following equalities hold: 

T xo (u *v) = T xo u *v = u * T xq v. (17.51) 


Proof If we recall the physical meaning of formula (17.48), the first of these equal- 
ities becomes obvious, and the second can then be obtained from the symmetry of 
convolution. Nevertheless, let us give a formal verification of the first equality: 


( T xo )(u * v)(x) := (u * v)(x - x 0 ) := 

" 4-00 


L 


u{y)v(x - x 0 - y) dy = 


— OO 

+oo 


/ +oo 

' 

-OO 


u(y - xq)v(x - y) dy = 


/ -too 

( T Xo u)(y)v(x - y) dy =: (( T Xo u ) * v)(x). 

-OO 


□ 


d. Differentiation of a Convolution 

The convolution of functions is an integral depending on a parameter, and differen- 
tiation of it can be carried out in accordance with the general rules for differentiating 
such integrals, provided of course suitable hypotheses hold. 

The conditions under which the convolution (17.49) of the functions u and v is 
continuously differentiable are demonstrably satisfied if, for example, u is continu- 
ous and v is a smooth function and one of the two is of compact support. 

Proof Indeed, if we confine the variation of the parameter to any finite interval, then 
under these hypotheses the entire integral (17.49) reduces to the integral over a finite 
closed interval independent of x. Such an integral can be differentiated with respect 
to a parameter in accordance with the classical rule of Leibniz. □ 

In general the following proposition holds: 

Proposition 4 If u is a locally integrable function and v is a C["‘ > function of com- 
pact support (0 < m < +oo), then (u * v) e C^f anct 

D k (ii*v) = u*(D k v). (17.52) 


4 Here D is differentiation, and, as usual D k v= u®. 
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Proof When u is a continuous function, the proposition follows immediately from 
what was just proved above. In its general form it can be obtained if we also keep in 
mind the observation made in Problem 6 of Sect. 17.1. □ 

Remark 1 In view of the commutativity of convolution (formula (17.50)) Proposi- 
tion 4 of course remains valid if u and v are interchanged, preserving the left-hand 
side of Eq. (17.52). 

Formula (17.52) shows that convolution commutes with the differentiation op- 
erator, just as it commutes with translation (formula (17.51)). But while (17.51) is 
symmetric in u and v, one cannot in general interchange u and v in the right-hand 
side of (17.52), since u may fail to have the corresponding derivative. The fact that 
the convolution u * v, as one can see by (17.52), may still turn out to be a differ- 
entiable function, might suggest that the hypotheses of Proposition 4 are sufficient, 
but not necessary for differentiability of the convolution. 

Example 1 Let / be a locally integrable function and S a the “step” function shown 
in Fig. 17.1. Then 

/ +oo J px 

f(y)8 a (x-y)dy=- f(y)dy, (17.53) 

-oo 01 J x—a 

and consequently if / is continuous at the points x and x — a, then the convolution 
f * 8 a is differentiable, due to the averaging (smoothing) property of the integral. 

The conditions for differentiability of the convolution stated in Proposition 4 are, 
however, completely sufficient for practically all the cases one encounters in which 
formula (17.52) is applied. For that reason we shall not attempt to refine them any 
further, preferring to illustrate some beautiful new possibilities that open up as a 
result of the smoothing action of convolution just discovered. 


17.4.3 Approximate Identities and the Weierstrass Approximation 
Theorem 

We remark that the integral in (17.53) gives the average value of the function 
/ on the interval [x — a,x], and therefore, if / is continuous at x, the relation 
(/ * 5„)(x) — 5 ► f(x) obviously holds as a — »• 0. In accordance with the introductory 
considerations of Sect. 17.4.1 that gave a picture of the 8 -function, we would like to 
write this last relation as the limiting equality 

(f * 8)(x) — f{x), if / is continuous at x. (17.54) 

This equality shows that the 5-function can be interpreted as the identity (neu- 
tral) element with respect to convolution. Equality (17.54) can be regarded as mak- 
ing perfect sense if it is shown that every family of functions converging to the 
5-function has the same property as the special family 8 a of (17.53). 
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Let us now pass to precise statements and introduce the following useful defini- 
tion. 

Definition 4 The family {A a \ a e A} of functions A a : M — ► M depending on the 
parameter a e A forms an approximate identity over a base B in A if the following 
three conditions hold: 

a) all the functions in the family are nonnegative ( A a > 0); 

b) for every function A a in the family, A a (x) dx = 1 ; 

c) for every neighborhood U of 0 e R, lirng f v A a (x) dx = 1. 

Taking account of the first two conditions, we see that this last condition is equiv- 
alent to the relation limg A a (x) dr = 0. 

The original family of “step” functions S a considered in Example 1 of Sect. 17.4.1 
is an approximate identity as a -* 0. We shall now give other examples of approxi- 
mate identities. 

Example 2 Let q > : R — > R be an arbitrary nonnegative function of compact support 
that is integrable over R and satisfies fgtp(x)dx — 1. For a > 0 we construct the 
functions A a (x) := The family of these functions is obviously an approxi- 

mate identity as a -> +0 (see Fig. 17.2). 

Example 3 Consider the sequence of functions 



for |;c| < 1, 


0 


for |*| > 1. 


To establish that this family is an approximate identity we need only verify that 
condition c) of Definition 4 holds in addition to a) and b). But for every e e ]0, 1] 
we have 



(l — e 2 )"(l — e) — > 0, as n — > oo. 


At the same time, 



Therefore condition c) holds. 


Example 4 Let 


cos 2 "(x)/ /j^ 2 2 cos 2 "(*)dx for \x\ < 7r/2, 


0 


for |*| > n /2. 


A„ (*) = 
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As in Example 3, it remains only to verify condition c) here. We remark first of 
all that 


l‘jr/2 

Jo 1 


cos 2 " x Ax — - B[ n -\ — 1 = - 


i 1\ i r{n + ±) r(i) r(i) 


2 2) 2 r(n) 


, 2 , > ^ 
2 n 


On the other hand, for e e ]0, 7r/2[ 


r n / 2 7 c 71 ! 2 

I cos xdx < I < 


cos 2 "ed.Y < y(cose) 2 ". 


Combining the two inequalities just obtained, we conclude that for every e e 
]0, tt/2], 


r*/ 2 

Je ' 


A n ( x ) d.r -> 0 as n — »■ oo, 
from which it follows that condition c) of Definition 4 holds. 


Definition 5 The function / : G — > C is uniformly continuous on the set E C G if 
for every e > 0 there exists p > 0 such that \ f (x) — f (y)\ < e for every x e E and 
every y e G belonging to the p -neighborhood Uq(x) of x in G. 

In particular, if E = G we simply get back the definition of a function that is 
uniformly continuous on its entire domain of definition. 

We now prove a fundamental proposition. 

Proposition 5 Let f : M — > C be a bounded function and { A a : a e A] an approx- 
imate identity as a — > a>. If the convolution f * A a exists for every a e A and the 
function f is uniformly continuous on the set E cl, then 

(/ * A a ){x) =1 /(x) on E as a co . 

Thus it is asserted that the family of functions f * A a converges uniformly to 
/ on a set E on which it is uniformly continuous. In particular, if E consists of 
only one point, the condition of uniform continuity of / on E reduces to the con- 
dition that / be continuous at x, and we find that (/ * A a )(x) -> f{x) as a — »■ co. 
Previously this fact served as our motivation for writing relation (17.54). 

Let us now prove Proposition 5. 

Proof Suppose |/(x)| < M on R. Given a number e > 0, we choose p > 0 in ac- 
cordance with Definition 5 and denote the p -neighborhood of 0 in R by U (0). 

Taking account of the symmetry of convolution, we obtain the following two 
estimates, which hold simultaneously for all x e E : 


17.4 Convolution and Generalized Functions 


453 


= f f(x-y)A a (y)dy- f(x) 

Jr 

= I f (f(x ~y)~ f(x))A a (y)dy 
I Jr 

<f \f(x — y) — f(x)\A a (y) dy + f \f(x - y) - f(x)\A a (y)dy 
Juc 0 ) Jm\U(0) 


< 


< £ 


f 

Ju( 0) 


A a (y)dy + 2M 


f 

Jr 


R\t/ (0) 


/R\f/(0) 

A a (y) dy < e + 2 M 


j 

Jr 


R\{/(0) 


A a (y)dy. 


As a — >■ £«, this last integral tends to zero, so that the inequality 
| (f *A a )(x) - f{x ) | < 2e 

holds for all x e £ from some point a e on. This completes the proof of Proposi- 
tion 5. □ 


Corollary 1 Every continuous function of compact support on R can be uniformly 
approximated by infinitely differentiable functions. 

Proof Let verify that Cq°°' is everywhere dense in Co in this sense. 

We let, for example, 

«,(*) = (*' exp( - for|x|<1 ’ 

l0 for |x| > 1, 

where k is chosen so that / R tp{x) d.v = 1. 

The function tp is of compact support and infinitely differentiable. In that case, 
the family of infinitely differentiable functions A a = as observed in Exam- 

ple 2, is an approximate identity asa^ +0. If / e Co, it is clear that f * A a e Cq. 
Moreover, by Proposition 4 we have f * A a e C£°. Finally, it follows from Propo- 
sition 5 that f * A a =1 / on R as a —*■ +0. □ 

Remark 2 If the function / e Co belongs to cfj m \ then for every value n e 
{0,1 , ,m] we can guarantee that (/ * A a f n ^ =4 f ^ on R as a —> +0. 

Proof Indeed, in this case (/ * = / (,,) * A a (see Proposition 4 and Re- 
mark 1). All that now remains is to cite Corollary 1. □ 

Corollary 2 (The Weierstrass approximation theorem) Every continuous function 
on a closed interval can be uniformly approximated on that interval by an algebraic 
polynomial. 
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Proof Since polynomials map to polynomials under a linear change of variable 
while the continuity and uniformity of the approximation of functions are preserved, 
it suffices to verify Corollary 2 on any convenient interval [a , b] C R. For that rea- 
son we shall assume 0 < a < b < 1. We continue the given function / e C[a, b] to 
a function F that is continuous on R by setting Fix) = 0 for x e R\]0, 1[ and, for 
example, letting F be a linear function going from 0 to f{a) and from f(b ) to 0 on 
the intervals [0, a ] and [b, 1] respectively. 

If we now take the approximate identity consisting of the functions A n of Exam- 
ple 3, we can conclude from Proposition 5 that F * A n f = F\\a.b\ on [a, b] as 
n —*■ oo. But for x e [a, b] C [0, 1] and y e [0, 1] we have |x — y | < 1, therefore 


-l 


F*A n {x)\=l F(y)A n (x - y)d.v = [ F(y)A n (x-y)dy = 

J — oo J 0 

£ F(y)p n ■ (1 - (.X - y) 2 )" dy = F(y) dy = 

2/i / ! 

UL 


F(y)a k (y)dy )x . 


This last expression is a polynomial P 2 n (x ) of degree 2 n and we have shown that 
Pjn f on [a, b ] as n oo. □ 


Remark 3 By a slight extension of this reasoning one can show that Weierstrass the- 
orem remains valid if the interval [a, b] is replaced by an arbitrary compact subset 
of R. 


Remark 4 It is also not difficult to verify that for every open set G in R and every 
function / e C tml (G) there exists a sequence { P k } of polynomials such that P,'" 1 ^ 
on every compact set K C G for each n e {0, 1, . . . , m } as k —*■ oo. 

If in addition the set G is bounded and / e C lm) (G), then one can even get 
P^ / (,,) onGas^oo. 

Remark 5 Just as the approximate identity of Example 3 was used in the proof of 
Corollary 2, one can use the sequence from Example 4 to prove that every 2n- 
periodic function on R can be uniformly approximated by trigonometric polynomi- 
als of the form 

n 

T n {x) = E a k cos kx + b k sin kx . 

k=0 

We have used only approximate identities made up of functions of compact sup- 
port above. However, it should be kept in mind that approximate identities of func- 
tions that are not of compact support play an important role in many cases. We shall 
give only two examples. 
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Example 5 The family of functions A v (x ) = ^ is an approximate identity on 
R. as y -> +0, since A y > 0 for y > 0, 



and for every p > 0 we have 



when y — »■ +0. 


If / : R. —*■ R is a bounded continuous function, then the function 



(17.55) 


which is the convolution / * A y , is defined for all x e R and v > 0. 

As one can easily verify using the Weierstrass M-test, the integral (17.55), which 
is called the Poisson integral for the half-plane , is a bounded infinitely differentiable 
function in the half-plane Kl = {(x, y) e R 2 | y >0}. Differentiating it under the 
integral sign, we verify that for y > 0 



that is, u is a harmonic function. 

By Proposition 5 one can also guarantee that u (x , y) converges to f(x) as y —> 0. 
Thus, the integral (17.55) solves the problem of constructing a bounded function 
that is harmonic in the half-plane and assumes prescribed boundary values / on 
3R^_. 

i 

Example 6 The family of functions A t = 2 J—^ e 4r is an approximate identity 
on K as t — > +0. Indeed, we certainly have A, > 0 and A t (x) = 1, since 
e~ v ~ dn = Rtt (the Euler-Poisson integral). Finally, for every p > 0 we have 



+0. 


as t 


If / is a continuous and, for example, bounded function on R, then the function 



(17.56) 


which is the convolution / * A,, is obviously infinitely differentiable for t > 0. 


456 


1 7 Integrals Depending on a Parameter 


By differentiating under the integral sign for t > 0, we find that 


3 u 
~dt 


d 2 u 




A, = 0, 


that is, the function u satisfies the one-dimensional heat equation with the initial 
condition u(x, 0) = fix). This last equality should be interpreted as the limiting 
relation u(x, t ) — > fix) as t — » +0, which follows from Proposition 5. 


17.4.4 * Elementary Concepts Involving Distributions 

a. Definition of Generalized Functions 

In Sect. 17.4.1 of this section we derived the formula (17.48) on the heuristic level. 
This equation enabled us to determine the response of a linear transformation A 
to an input signal / given that we know the system function E of the device A. 
In determining the system function of a device we made essential use of a certain 
intuitive idea of a unit pulse action and the 3-function that describes it. It is clear, 
however, that the 3-function is really not a function in the classical sense of the term, 
since it must have the following properties, which contradict the classical point of 
view: 3 (x) > 0 on R; 3 (x) = 0 for x ^ 0, / R 3 (x) d.v = 1 . 

The concepts connected with linear operators, convolution, the 3-function, and 
the system function of a device acquire a precise mathematical description in the 
so-called theory of generalized functions or the theory of distributions. We are now 
going to explain the basic principles and the elementary, but ever more widely used 
techniques of this theory. 

Example 7 Consider a point mass m that can move along the axis and is attached to 
one end of an elastic spring whose other end is fixed at the origin; let Ic be the elastic 
constant of the spring. Suppose that a time-dependent force fit) begins to act on 
the point resting at the origin, moving it along the axis. By Newton’s law, 


mx. + kx = f, (17.57) 

where x(t) is the coordinate of the point (its displacement from its equilibrium po- 
sition) at time t. 

Under these conditions the function x(t) is uniquely determined by the func- 
tion /, and the solution x it) of the differential equation (17.57) is obviously a 
linear function of the right-hand side /. Thus we are dealing with the linear op- 
erator / x inverse to the differential operator x i— / (where B = m ^ + k) 
that connects x(t) and fit) by the relation Bx — f . Since the operator A obviously 
commutes with translations over time, it follows from (17.48) that in order to find 
the response x(f) of this mechanical system to the function fit), it suffices to find 
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its response to a unit pulse <5, that is, it suffices to know the so-called fundamental 
solution E of the equation 


mE + kE — 8. 


(17.58) 


Relation (17.58) would not raise any problems if 8 actually denoted a function. 
However Eq. (17.58) is not yet clear. But being formally unclear is quite a differ- 
ent thing from being actually false. In the present case one need only explain the 
meaning of (17.58). 

One route to such an explanation is already familiar to us: we can interpret 8 
as an approximate identity imitating the delta-function and consisting of classical 
functions A a (t); we interpret E as the limit to which the solution E a (t) of the 
equation 


(17.57') 


m E a -}- kE a — Aqi 


tends as the parameter a changes suitably. 

A second approach to this problem, one that has significant advantages, is to 
make a fundamental enlargement of the idea of a function. It proceeds from the 
remark that in general objects of observation are characterized by their interaction 
with other (“test”) objects. Thus we propose regarding a function not as a set of 
values at different points, but rather as an object that can act on other (test) objects 
in a certain manner. Let us try to make this statement, which as of now is too general, 
more specific. 

Example 8 Let f e C(R, R). As our test functions, we choose functions in Co (con- 
tinuous functions of compact support on R). A function / generates the following 
functional, which acts on Cq: 



(17.59) 


Using approximate identities consisting of functions of compact support, one can 
easily see that (/, <p) = 0 on Co if and only if /(x) = 0 on R. 

Thus, each function f e C(R,R) generates via (17.59) a linear functional 
A f : Co —*■ R and, we emphasize, different functionals A p and A f 2 correspond 
to different functions f\ and f%. 

Hence formula (17.59) establishes an embedding (injective mapping) of the set 
of functions C(R, R) into the set £(Co; R) of linear functionals on Co, and con- 
sequently every function f e C(R, R) can be interpreted as a certain functional 
Af e £(Co; R). 

If we consider the class of locally integrable functions on R instead of the set 
C (R, R) of continuous functions, we obtain by the same formula (17.59) a mapping 
of this set into the space £(Co; R). Moreover ((/, cp) = 0 on Co) O (/(x) — 0 at all 
points of continuity of / on R, that is, f(x) = 0 almost everywhere on R). Hence in 
this case we obtain an embedding of equivalence classes of functions into C(Cq\ R) 
if each equivalence class contains locally integrable functions that differ only on a 
set of measure zero. 
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Thus, the locally integrable functions / on R (more precisely, equivalence 
classes of such functions) can be interpreted via (17.59) as linear functionals 
A f e £(Co; R). The mapping / m>- Af = (/, •) provided by (17.59) of locally in- 
tegrable functions into C(Co\ R) is not a mapping onto all of £(Co; R). Therefore, 
interpreting functions as elements of L ( C’o : R) (that is, as functionals) we obtain, 
besides the classical functions interpreted as functionals of the form (17.59), also 
new functions (functionals) that have no pre-image in the classical functions. 

Example 9 The functional 8 e C(Cq; R) is defined by the relation 


(8,<P) :=8(<p):=<p(0). 


(17.60) 


which must hold for every function cp e Co- 

We can verify (see Problem 7) that no locally integrable function / on R can 
represent the functional S in the form (17.59). 

Thus we have embedded the set of classical locally integrable functions into a 
larger set of linear functionals. These linear functionals are called generalized func- 
tions or distributions (a precise definition is given below). The widely used term 
“distribution” has its origin in physics. 

Example 10 Suppose a unit mass (or unit charge) is distributed on R. If this dis- 
tribution is sufficiently regular, in the sense that it has, for example, a continuous 
or integrable density f>(x) on R, the interaction of the mass M with other objects 
described by functions (po e Cq 00 ’ can be defined as a functional 



If the distribution is singular, for example, the whole mass M is concentrated at a 
single point, then by “smearing” the mass and interpreting the limiting point situ- 
ation using an approximate identity made up of regular distributions, we find that 
the interaction of the mass M with the other objects mentioned above should be 
expressed by a formula 


M(<p) = <p(0), 


which shows that such a mass distribution on R should be identified with the <5- 
function (17.60) on R. 

These preliminary considerations give some sense to the following general defi- 
nition. 

Definition 6 Let P be a vector space of functions, which will be called the space of 
test functions from now on, on which there is defined a notion of convergence. 

The space of generalized functions or distributions on P is the vector space P' 
of continuous (real- or complex- valued) linear functionals on P . Here it is assumed 
that each element / e P generates a certain functional A f = (/,■} e P' and that the 
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mapping / h->- A t is a continuous embedding of P into P' if the convergence in P' 
is introduced as weak (“pointwise”) convergence of functionals, that is, 

P'3A„-> Ae P' :=V<p e P ( A n (<p ) ->• A(q>f). 

Let us make this definition more precise in the particular case when P is the 
vector space C^^iG, C) of infinitely differentiable functions of compact support 
in G, where G is an arbitrary open subset of R (possibly R itself). 

Definition 7 (The spaces 29 and 29') We introduce convergence in Cq 00 ' 1 (G, C) as 
follows: A sequence {<£>„} of functions (p n e C { 0 °°\g ) converges to <p e (G, C) 
if there exists a compact set K C G that contains the supports of all the functions of 
the sequence {<£>„} and =4 (p on K (and hence also on G) as n —*■ oo for all 
m = 0, 1,2,.,.. 

The vector space obtained in this way with this convergence is usually denoted 
29(G), and when G = M, simply 29. 

We denote the space of generalized functions (distributions) corresponding to 
this space of basic (test) functions by 29 , (G) or 29' respectively. 

In this section and the one following we shall not consider any generalized func- 
tions other than the elements of 29'(G) just introduced. For that reason we shall use 
the term distribution or generalized function to refer to elements of 29 '(G) without 
saying so explicitly. 

Definition 8 A distribution F e 29' (G) is regular if it can be represented as 

F(<p)= [ f(x)cp(x) dr, <p e 29(G), 

Jg 

where / is a locally integrable function in G. 

Nonregular distributions will be called singular distributions or singular gener- 
alized functions. 

In accordance with this definition the S -function of Example 9 is a singular gen- 
eralized function. 

The action of a generalized function (distribution) F on a test function (p, that 
is, the pairing of F and (p will be denoted, as before, by either of the equivalent 
expressions F((p ) or (F,<p). 

Before passing to the technical machinery connected with generalized functions, 
which was our motive for defining them, we note that the concept of a general- 
ized function, like the majority of mathematical concepts, had a certain period of 
gestation, during which it developed implicitly in the work of a number of mathe- 
maticians. 

Physicists, following Dirac, made active use of the S-function as early as the late 
1920s and early 1930s and operated with singular generalized functions without 
worrying about the absence of the necessary mathematical theory. 
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The idea of a generalized function was stated explicitly by S.L. Sobolev, 5 who 
laid the mathematical foundations of the theory of generalized functions in the mid- 
1930s. The current state of the machinery of the theory of distributions was largely 
the work of L. Schwartz. 6 What has just been said explains why, for example, the 
space V' of generalized functions is often referred to as the Sobolev-Schwartz space 
of generalized functions. 

We shall now explain certain elements of the machinery of the theory of dis- 
tributions. The development and extension of the use of this machinery continues 
even today, mainly in connection with the requirements of the theory of differen- 
tial equations, the equations of mathematical physics, functional analysis, and their 
applications. 

To simplify the notation we shall consider below only generalized functions in 
V , although all of their properties, as will be seen from their definitions and proofs, 
remain valid for distributions of any class where G is an arbitrary open 

subset of R. 

Operations with distributions are defined by starting with the integral relations 
that are valid for classical functions, that is, for regular generalized functions. 


b. Multiplication of a Distribution by a Function 

If / is a locally integrable function on R and g e C <0 °K then for any function cp e 
C l 0 °°\ on the one hand gq> e Cq 00 ’ and, on the other hand, we have the obvious 


equality 



or, in other notation 


(f ■ 8,<P) = (f,g-<P)- 


This relation, which is valid for regular generalized functions, provides the basis 
for the following definition of the distribution F ■ g obtained by multiplying the 
distribution F eD 1 by the function g e (7*°°^: 


(F-g,<p) := ( F,g-<p ). 


(17.61) 


The right-hand side of Eq. (17.61) is defined, and thus defines the value of the 
functional F ■ g on any function q> e D, that is, the functional F ■ g itself is defined. 


5 S.L. Sobolev (1908-1989) - one of the most prominent Soviet mathematicians. 

6 L. Schwartz (1915-2002)- well-known French mathematician. He was awarded the Fields medal, 
a prize for young mathematicians, at the International Congress of Mathematicians in 1950 for the 
above mentioned work. 
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Example 11 Let us see how the distribution 8 ■ g acts, where g e C (oo \ In accor- 
dance with the definition (17.61) and the definition of S , we obtain 

{8 ■ g, <p) := { 8 , g-tp):=(g- <p)( 0) = g(0) • <p{ 0). 


c. Differentiation of Generalized Functions 


If / e and (p e C q°°\ integration by parts yields the equality 

I f\x)<p(x) dx = — I f (x)<p' (x) dx . (17.62) 

Jr Jr 

This equality is the point of departure for the following fundamental definition 
of differentiation of a generalized function F e T>'\ 

[F r , <p) := —(F , tp'). (17.63) 

Example 12 If / s C®, the derivative of / in the classical sense equals its deriva- 
tive in the distribution sense (provided, naturally, the classical function is identified 
with the regular generalized function corresponding to it). This follows from a com- 
parison of relations (17.62) and (17.63), in which the right-hand sides are equal if 
the distribution F is generated by the function /. 


Example 13 Take the Heaviside 7 function 


H(x) = 


0 for x < 0, 

1 for x > 0, 


sometimes called the unit step. Regarding it as a generalized function, let us find the 
derivative H' of this function, which is discontinuous in the classical sense. 

From the definition of the regular generalized function H corresponding to the 
Heaviside function and relation (17.63) we find 

/ +oo r+oo 

H(x)(p\x)dx = / (p\x)dx=(p(0), 

-oo JO 

since (p e Cq°°\ Thus {H' , <f) = (8, q>), for every function <p e Cq°°\ Hence H' = 8. 


Example 14 Let us compute (S ' , q> ): 

[8',<p)-.= -[8,<p')=-<p\ 0). 


7 O. Heaviside (1850-1925) - British physicist and engineer, who developed on the symbolic level 
the important mathematical machinery known as the operational calculus. 
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It is natural that in the theory of generalized functions, as in the theory of classical 
functions, the higher-order derivatives are defined by setting F^ n+l ^ := ( F {n 
Comparing the results of the last two examples, one can consequently write 


{H",<p) = -<p\ 0). 


Example 15 Let us show that (S^ n \ cp) — (— l) 11 (p (n \0) . 

Proof For n = 0 this is the definition of the 5-function. 

We have seen in Example 14 that this equality holds for n = 1 . 

We now prove it by induction, assuming that it has been established for a fixed 
value n e N. Using definition (17.63), we find 

[& (n+l \cp) :=(($<">)', *>):=-(«("V) = 

= — (— 1)"(^ , ) ( " ) (0) = (— 1)" + V (n+1) (0). □ 

Example 16 Suppose the function / : R —> C is continuously differentiable for 
* < 0 and for x >■ 0, and suppose the one-sided limits f ( — 0) and f (-1-0) of the 
function exist at 0. We denote the quantity /(+0) — /(— 0), the saltus or jump of 
the function at 0, by J/(0), and by f and {/'} respectively the derivative of / in 
the distribution sense and the distribution defined by the function equal to the usual 
derivative of / for x < 0 and x > 0. At x — 0 this last function is not defined, but 
that is not important for the integral through which it defines the regular distribu- 
tion {/'}. 

In Example 12 we noted that if / e C^, then /' = {/'}. We shall show that in 
general this is not the case, but rather the following important formula holds: 

f'={f'}+If(0)-S. (17.64) 


Proof Indeed, 

/ +oo 

f(x)(fi'(x)dx = 

-OO 

= -(J° +| +M )(/W?'W)d.r = 

= -(V ■ <PM)| < jL_ 0O - J f(x)<p(x)dx + (f ■ <p)(x ) | 

~L = 

/ +oo 

ffxMx) dx = 

-OO 

= {sm-8,<p) +{{/'},?). 


-j-oo 

0 


□ 
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If all derivatives up to order m of the function / : R. C exist on the intervals 
x < 0 and x > 0, and they are continuous and have one-sided limits at x = 0, then, 
repeating the reasoning used to derive (17.64), we obtain 

/('") = {/ (m) } + J/0 0) • 5 (m_1) + J/'(0) • S (m ~ 2) + ••■ 

• • • + J/ (m-1) (0) • S. (17.65) 

We now exhibit some properties of the operation of differentiation of generalized 
functions. 

Proposition 6 a) Every generalized function F eD' is infinitely differentiable. 

b) The differentiation operation D : D’ — » D' is linear. 

c) If F e T>' and g e C*- 00 *, then (F ■ g) e T>' , and the Leibniz formula holds'. 

m 
k 

k = 0 v 

d) The differentiation operation D :T>' — > D’ is continuous. 

e) If the series YYkL\ fk( x ) = $(x) formed from locally integrable functions 
fk : R — > C converges uniformly on each compact subset o/R, then it can be dif- 
ferentiated termwise any number of times in the sense of generalized functions, and 
the series so obtained will converge in T>' . 

Proof a) (F (m \q>) := — <p') := (-1 ) m (F, <p (m) >. 

b) Obvious. 

c) Let us verify the formula for m — 1 : 

{(F ■ gY, <p) := ~[Fg, tp’) := ~[F, g-<p') = -(F, (g ■ <p)’ - g' -<p) = 

= {F\ g(p ) + (F, g' -<p) = ( F ' -g,<p) + (F- g', (p) = [F f ■ g + F ■ g', <p). 

In the general case we can obtain the formula by induction. 

d) Let F m — > F in V as m — > oo, that is, for every function rp eT>(F m ,(p) — >■ 
(F, cp) as m — >■ oo. Then 

K,<p):=-{F m ,v’)^-{F, V r)=:{F’,<p). 

e) Under these conditions the sum S(.r) of the series, being the uniform limit 
of locally integrable functions S m (x) = Y^k=i fk( x ) on compact sets, is locally in- 
tegrable. It remains to observe that for every function <p e D (that is, of compact 
support and infinitely differentiable) we have the relation 

( S m ,(p)= I S m (x)<p(x) dv — ► / S(x)cp(x) dx = (S, (p). 

Jr Jr 

We now conclude on the basis of what was proved in d) that S' m — »■ S' as m — > oo . □ 


(F-g)^ = J2 


^ F® • g( m ~ k l 
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We see that the operation of differentiation of generalized functions retains the 
most important properties of classical differentiation while acquiring a number of 
remarkable new properties that open up a great deal of freedom of operation, which 
did not exist in the classical case because of the presence of nondifferentiable func- 
tions there and the instability (lack of continuity) of classical differentiation under 
limiting processes. 

d. Fundamental Solutions and Convolution 

We began this subsection with intuitive ideas of the unit pulse and the system func- 
tion of the device. In Example 7 we exhibited an elementary mechanical system that 
naturally generates a linear operator preserving time shifts. Studying it, we arrived 
at Eq. (17.58), which the system function E of that operator must satisfy. 

We shall conclude this subsection by returning once again to these questions, 
but now with the goal of illustrating an adequate mathematical description in the 
language of generalized functions. 

We begin by making sense of Eq. (17.58). On its right-hand side is the gen- 
eralized function <5, so that relation (17.58) should be interpreted as equality of 
generalized functions. Since we know the operations of differentiating generalized 
functions and linear operations on distributions, it follows that the left-hand side of 
Eq. (17.58) is now also comprehensible, even if interpreted in the sense of general- 
ized functions. 

Let us now attempt to solve Eq. (17.58). 

At times t < 0 the system was in a state of rest. At t — 0 the point received a unit 
pulse, thereby acquiring a velocity v = u(0) such that mv — 1. For t > 0 there are 
no external forces acting on the system, and its law of motion x — x(t) is subject to 
the usual differential equation 


mx + kx = 0, 


(17.66) 


which are to be solved with the initial conditions x(0) = 0, i(0) = v — 1/m . 
Such a solution is unique and can be written out immediately: 



Since in the present case the system is at rest for t < 0, we can conclude that 



(17.67) 


where H is the Heaviside function (see Example 13). 

Let us now verify, using the rules for differentiating generalized functions and the 
results of the examples studied above, that the function E(t) defined by Eq. (17.67) 
satisfies Eq. (17.58). 
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To simplify the writing we shall verify that the function 


e(x) = H(x)~ 


sincux 


co 


(17.68) 


satisfies (in the sense of distribution theory) the equation 


(A,+*r ]<. = «. 


(17.69) 


Indeed, 


d 2 \ d 2 / 


dx 2 


H- 


sin&)x 


co 


+ co z H 


sin cox 


co 


.. sin cox . 

= H \-2H cos cox — co H(x) sin cox + 

CO 

. sin cox 

+ co H(x) sin cox = 8 h 2<$ cos&>x. 

co 


Further, for every function cp e V, 


8 ' 


sin cox 


co 


\ I . sina>x \ , , 

+ 28 cos cox, cp] = (^8 , cpj + (6, 2(cos cox)<pj = 


= -{S 9 — 


d / sinwx 


dx \ co 


<P)) + 2 ^( 0 ) = 


/ sin cox , 

= — I (cos cox)cp(x) H <p (x) 

V CO 

= <p( 0)= {8,<p}, 


+ 2<p(Q) = 


x=0 


and it is thereby verified that the function (17.68) satisfies (17.69). 

Finally, we introduce the following definition. 

Definition 9 A fundamental solution or Green’s function (system function or influ- 
ence function) of the operator A : V — »■ V is a generalized function E eT>' that is 
mapped by A to the function 8 e />', that is, A(E) — 8. 

Example 77 In accordance with this definition the function (17.68) is a fundamental 

j2 r\ t 

solution for the operator A = (-^-j + or), since it satisfies (17.69). 

The function (17.67) satisfies Eq. (17.58), that is, it is a Green’s function for 

.2 

the operator A = (m + Ic) . The fundamental role of the system function of a 
translation-invariant operator has already been discussed in Sect. 17.4.1, where for- 
mula (17.48) was obtained, on the basis of which one can now write the solution of 
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Eq. (17.57) corresponding to the initial conditions given in Example 7: 

/ +oo sin J — t 

f(t — r)H(r) — j==~ dr, (17.70) 

-oo -s /km 

1 f + °° IT 

x{t) — / fit — r) sin,/ — r dr. (17.71) 

s/km Jo V m 

When we take account of the important role of the convolution and the funda- 
mental solution just illustrated, it becomes clear that it is desirable to define the 
convolution of generalized functions also. This is done in the theory of distribu- 
tions, but we shall not take the time to do so. We note only that in the case of regular 
distributions the definition of the convolution of generalized functions is equivalent 
to the classical definition of the convolution of functions studied above. 


17.4.5 Problems and Exercises 


1. a) Verify that convolution is associative: u * (v * w) = (u * v) * w. 

b) Suppose, as always, that r(a) is the Euler gamma function and H(x) is the 
Heaviside function. We set 


H?{x)\=H{x)~ , 

1 J r(a) 


where a > 0, and XeC. 


Show that H? * wf = . 

c) Verify that the function F = H (x) / j e Xx is the nth convolution power of 
f = H{x)e Xx , that is, F = /*/*•••* /. 

v- 

n 


2. The function G a (x) = — 1— e , a > 0, defines the probability density func- 

G v 27T 

tion for the Gaussian normal distribution. 


a) Draw the graph of G, 7 (x) for different values of the parameter a . 

b) Verify that the mathematical expectation (mean value) of a random variable 

with the probability distribution G a is zero, that is, (x ) dx = 0. 

c) Verify that the standard deviation of x (the square root of the variance of x) 
is cr, that is (/ ffi x 2 Go-(x) dx) 1 / 2 = cr. 

d) It is proved in probability theory that the probability density of the sum of two 
independent random variables is the convolution of the densities of the individual 
variables. Verify that G a * Gp = c ^ y a 2 + pi - 

e) Show that the sum of n independent identically distributed random variables 
(for example, n independent measurements of the same object), all distributed ac- 
cording to the normal law G n , is distributed according to the law G n j n . From this it 


17.4 Convolution and Generalized Functions 


467 


follows in particular that the expected order of errors for the average of n such mea- 
surements when taken as the value of the measured quantity, equals a/y/n, where 
a is the probable error of an individual measurement. 

3. We recall that the function A(;r) = is called the generating function 

of the sequence oq, a\ 

Suppose given two sequences {«* } and {bk}. If we assume that ak — bk — 0 for 
k < 0 , then the convolution of the sequences {ak} and {bk} can be naturally defined 
as the sequence {ck — a m bk- m }- Show that the generating function of the con- 
volution of two sequences equals the product of the generating functions of these 
sequences. 

4. a) Verify that if the convolution u * v is defined and one of the functions u and 
v is periodic with period T, then u*v is also a function of period T. 

b) Prove the Weierstrass theorem on approximation of a continuous periodic 
function by a trigonometric polynomial (see Remark 5). 

c) Prove the strengthened versions of the Weierstrass approximation theorem 
given in Example 4. 

5. a) Suppose the interior of the compact set K C M contains the closure E of the 
set E in Proposition 5. Show that in that case f K f(y)Ak(x — y) dy =1 fix) on E . 

b) From the expansion (1 — z ) _1 = 1 + z + z 2 + ■ ■ ■ deduce that g(p, 0) = 

i+pe^pV 2 ^... forO<p<l. 

c) Verify that if 0 < p < 1 and 

1 9 

Pp(6) := R eg(p,9) = - + p cos 9 + p~cos20 H , 

then the function P p (0) has the form 


Pp(Q) r\ , , „ 9 

2 1—2 p cos 9 + p l 

and is called the Poisson kernel for the disk. 

d) Show that the family of functions P p (0) depending on the parameter p has the 

following set of properties: P p {9) > 0, ^ P p (0)d9 = 1, f^ 0 f P p (6)d9 -> 0 

as p 1 — 0 . 

e) Prove that if / e C[0, 27r], then the function 

1 r 2n 

u(p,9) = — P p {9 - t)f(t)dt 

Tt Jo 

is a harmonic function in the disk p < 1 and u(p, 9 ) =1 f(9) as p —*■ 1—0. Thus, 
the Poisson kernel makes it possible to construct a function harmonic in the disk 
having prescribed values on the boundary circle. 

f) For locally integrable functions u and v that are periodic with the same pe- 
riod T , one can give an unambiguous definition of the convolution (convolution 
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over a period) as follows: 

(u * i>^(x) := J u(y)v(x — y) dy. 

The periodic functions on R can be interpreted as functions defined on the circle, 
so that this operation can naturally be regarded as the convolution of two functions 
defined on a circle. 

Show that if f(9) is a locally integrable 2tt -periodic function on R (or, what is 
the same, / is a function on a circle), and the family P p (9) of functions depending 
on the parameter p has the properties of the Poisson kernel enumerated in d), then 
(/ * P p )(9) -> f{9) as p — > 1 — 0 at each point of continuity of /. 

2 71 

6. a) Suppose cp(x) a expt i ) for |x| < 1 and (p(x) := 0 for \x\ > 1. Let the 
constant a be chosen so that /). cp{x) dx = 1. Verify that the family of functions 
(p a (x) = ^<p(| ) is an approximate identity as a — > +0 consisting of functions in 
C^°° ) on R. 

b) For every interval 1 C R and every e > 0 construct a function e(x) of class 
Cq°°' ) such that 0 < e(x) < 1 on R, e(x) = 1 x e I, and finally, suppe C / £ , 
where I e is the e-neighborhood (or the e-inflation) of the set I in R. (Verify that for 
a suitable value of a > 0 one can take e(x ) to be // * <p a .) 

c) Prove that for every e > 0 there exists a countable set {ek} of functions ek e 
Cq°° > (an e-partition of unity on R) that possesses the following properties: Vk e N, 
Vi e R (0 < ek(x) < 1); the diameter of the support supper of every function in the 
family is at most e > 0; every point ,teR belongs to only a finite number of the 
sets supper; ek(x) = 1 on R. 

d) Show that for every open covering {U y , y e r] of the open set G C R and 
every function q> e C (oo> (G) there exists a sequence {<pk\ k e N} of functions (pk e 
Cq 00) that has the following properties: Vk e N3y e r (supp^j- C U y )\ every point 
x e G belongs to only a finite number of sets supp<^; J2k Vk(x) = (p(x) on G. 

e) Prove that the set of functions Cg°° > interpreted as generalized functions is 
everywhere dense in the corresponding set C (oo> (G) of regular generalized func- 
tions. 

f) Two generalized functions f’i and Fi in T>'(G) are regarded as equal on an 
open set U C G if {F\ , (p) = {/R, <k) for every function ip e T>(G) whose support is 
contained in U. Generalized functions F\ and Fi are regarded as locally equal at the 
point x e G if they are equal in some neighborhood U(x) C G of that point. Prove 
that (/q = f 2 ) 4V ( F\ = F 2 locally at each point x e G). 

7. a) Let 1 p(x) := exp( . J. ) for \x\ < 1 and tp(x) 0 for |x| > 1. Show that 

/k f(x)<Pe(x) dx -» 0 as e -> +0 for every function / that is locally integrable on 
R, where cp E (x ) = <p(f)- 

b) Taking account of the preceding result and the fact that (8, (p e ) = (p( 0) ^ 0, 
prove that the generalized function 8 is not regular. 
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c) Show that there exists a sequence of regular generalized functions (even cor- 
responding to functions of class Cg 00 * ) that converges in D' to the generalized func- 
tion 8. (In fact every generalized function is the limit of regular generalized func- 
tions corresponding to functions in V = Cq°° 1 . In this sense the regular generalized 
functions form an everywhere dense set in V , just as the rational numbers Q are 
everywhere dense in the real numbers R.) 

8. a) Compute the value (F, (p) of the generalized function F eV on the function 
tp e V if F = sinx<5; F — 2cosx5; F = (1 + x 2 )&. 

b) Verify that the operation F — »• i jrF of multiplication by the function if e 
C (oc> is a continuous operation in V . 

c) Verify that linear operations on generalized functions are continuous in V . 

9. a) Show that if F is the regular distribution generated by the function f(x) — 
{ v for i >o l * len ~ where H is the distribution corresponding to the Heaviside 
function. 

b) Compute the derivative of the distribution corresponding to the function |x|. 

10. a) Verify that the following limiting passages in V are correct: 

a ax x 

lim ~ ^=jt8; lim — = = =nx8; lim — = T =ln|x|. 

a->+0x z + a~ «->tOr + j- »-»+0r+r 

b) Show that if / = fix) is a locally integrable function on R and f s — 
f (x + e), then f E -» / in V as s — >■ 0. 

c) Prove that if { A a } is an approximate identity consisting of smooth functions 
as a — > 0, then F a — f* A a (t)At —> // as a -> 0, where H is the generalized 
function corresponding to the Heaviside function. 

11. a) The symbol 8(x — a) usually denotes the 8-function shifted to the point a, 

that is, the generalized function acting on a function cp e V according to the rule 
{8fx — a), <p) — cp(a). Show that the series ~ k) converges in V . 

b) Find the derivative of the function [x] (the integer part of x). 

c) A 27T -periodic function on R is defined in the interval ]0, 2n] by the formula 

/ l]0,2*]CO =2~ Zr- Sh0W that /' = “ 2V + £keZ S ( x - 27tk )- 

d) Verify that 8(x — e) —*■ 5(x) as e -» 0. 

e) As before, denoting the 5-function shifted to the point e by <5(x — e), show 
by direct computation that ^(5(x — e) — <5(x)) — »■ —8'(x) = —8'. 

f) Starting from the preceding limiting passage, interpret — 8 1 as the distribu- 
tion of charges corresponding to a dipole with electric moment + 1 located at the 
point x — 0. Verify that (—S', 1 } = 0 (the total charge of a dipole is zero) and that 
(—8', x) = 1 (its moment is indeed 1). 

g) An important property of the 8 -function is its homogeneity: 8 ()ix) — 
A -1 5(x). Prove this equality. 
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12. a) For the generalized function F defined as { F , q>) — *Jx<p(x) dx, verify 
the following equalities: 


(*"’<?) 

(F"><P) 

{F'"M 


7=- UA , 

\Jx 

1+00 tp(x) — cp{ 0) 


U 

1 r + °° (p(x) — cp 

_ 4 Jo ^ 72 - 

3 r + °° (fi(x) — <p( 0) — x(p '{ 0 
8 7o xV2 


dx; 


dx; 


| F„„,y)= (-D'-H 2 '— 3 ) !! x 


L 


2 " 

+°° <p{x) — <p( 0) — x<p'( 0) 

2h+1 

V 2 


in- 2)!^ 


in— 2) 


( 0 ) 


dx . 


b) Show that if n — 1 < p < n and the generalized function x + p is defined by 
the relation 




+°° ^(x) — ^(0) — x^/(0) 


- v ^ ( "- 2) (0) 


in— 2) 


X P 


dx . 


Then its derivative is the function — px + (p+i) defined by the relation 


-ip+ 1) 
-px + 


’<P)=~P 

Jo 


+°° <p(x) — (p(0) — x<p'(0) ^ z W- (p{n 1)( ^ 0) 


t p+l 


dx . 


13. The generalized function defined by the equality 


(J 7 


/ +oo 

-00 


<P(x) 


dx I := lim 


e^+0\J- 


—s p+oo 


/ +/ 


\ <?(*) 


dx 


is denoted 'P j . Show that 

a) (pl,<p) = f+°° vM-v(-x) dy 

b) (ln|x|)' = P±. 

C ) ((ply, x) = / 0 +o ° dv . 

d) dio : = lim ^+od^ = - i7T8 + v l- 

14. Some difficulties may arise with the definition of the multiplication of gener- 
alized functions: for example, the function |x| -2 / 3 is absolutely (improperly) inte- 
grate on R; it generates a corresponding generalized function \x\~ 2 ^ <f>(x) dx, 
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but its square |x| -4 / 3 is no longer an integrable function, even in the improper sense. 
The answers to the following questions show that it is theoretically impossible to 
define a natural associative and commutative operation of multiplication for any 
generalized functions. 

a) Show that /(x)<5 — f(0)S for every function / e C (oo> . 

b) Verify that xV\ = 1 in V . 

c) If the operation of multiplication were extended to all pairs of generalized 
functions, it would at least not be associative and commutative. Otherwise, 

0 = 0"P- = (x&{x))V- = (&(x)x)V- = &(x)(xV-\=&{x)\ = l<5(x) = <5. 
x x 7 x V x) 


15 . a) Show that a fundamental solution E for the linear operator ,4 : V — »• T)' is 
in general ambiguously defined, up to any solution of the homogeneous equation 
Af = 0. 

b) Consider the differential operator 


P 




d" d”-' 


Show that if uq = uq(x) is a solution of the equation P(x, -^)uo — 0 that sat- 
isfies the initial conditions uq{0) = • • • = Mq” ”*(0) = 0 and Mq” 11 (0) = 1, then the 
function E(x.) — H(x)uq(x) (where H(X ) is the Heaviside function) is a funda- 
mental solution for the operator P(x , -r-). 

c) Use this method to find the fundamental solutions for the following opera- 
tors: 



d) Using these results and the convolution, find solutions of the equations 
0 = f, (£ + a) m = /, where / e C(K, R). 


17.5 Multiple Integrals Depending on a Parameter 

In the first two subsections of the present section we shall exhibit properties of 
proper and improper multiple integrals depending on a parameter. The total re- 
sult of these subsections is that the basic properties of multiple integrals depending 
on a parameter do not differ essentially from the corresponding properties of one- 
dimensional integrals depending on a parameter studied above. In the third subsec- 
tion we shall study the case of an improper integral whose singularity itself depends 
on a parameter, which is important in applications. Finally, in the fourth subsection 
we shall study the convolution of functions of several variables and some specifi- 
cally multi-dimensional questions on generalized functions closely connected with 
integrals depending on a parameter and the classical integral formulas of analysis. 
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1 7.5.1 Proper Multiple Integrals Depending on a Parameter 

Let X be a measurable subset of R" , for example, a bounded domain with smooth 
or piecewise-smooth boundary, and let Y be a subset of R" . 

Consider the following integral depending on a parameter: 

F(y)= [ f{x,y) dx, (17.72) 

Jx 

where the function / is assumed to be defined on the set X x Y and integrable on 
X for each fixed value of y e Y. 

The following propositions hold. 

Proposition 1 If X x Y is a compact subset ofW l+m and f eC(X x Y), then 
F e C(Y). 

Proposition 2 If Y is a domain in R m , / e C(X xY), and |£- e C (X x F) , then the 

function F is differentiable with respect to y' in Y , where y = (y 1 , ... ,y‘ .... , y m ) 
and 

9 F f 9/ 

r7 (y)= / -f L 1 (x,y)dx. (17.73) 

3 y' Jx 3 y‘ 

Proposition 3 If X and Y are measurable compact subsets ofW and R"' respec- 
tively, while f e C(X x T), then F e C(Y ) C 7 Z(Y), and 

[ F(y)dy:= [ d y f f(x,y) dx = f dx f f(x,y)dy. (17.74) 

Jy Jy Jx Jx Jy 

We note that the values of the function / here may lie in any normed vector 
space Z. The most important special cases occur when Z is M, C, R", or C". In 
these cases the verification of Propositions 1-3 obviously reduce to the case of their 
proof for Z = R. But for Z = R the proofs of Propositions 1 and 2 are verbatim 
repetitions of the proof of the corresponding propositions for a one-dimensional 
integral (see Sect. 17.1), and Proposition 3 is a simple corollary of Proposition 1 
and Fubini’s theorem (Sect. 1 1.4). 


17.5.2 Improper Multiple Integrals Depending on a Parameter 

If the set X C R" or the function f(x, y) in the integral (17.72) is unbounded, it 
is understood as the limit of improper integrals over sets of a suitable exhaustion 
of X. In studying multiple improper integrals depending on a parameter, as a rule, 
one is interested in particular exhaustions like those that we studied in the one- 
dimensional case. In complete accord with the one-dimensional case, we remove 
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the e -neighborhood of the singularities, 8 find the integrals over the remaining parts 
X E of X and then find the limit of the values of the integrals over X E as s — > +0. 

If this limiting passage is uniform with respect to the parameter y e T, we say 
that the improper integral (17.72) converges uniformly on Y. 

Example 1 The integral 

F(X) = [f s~ Hx2+y2) dx d y 

JJ R2 


results from the limiting passage 



e -Hx 2 +y 2 ) 


dx dy := 


lim f f e kix +y 1 dx dy 

£-> +0 J J X 2 + y2 < IJgl 


and, as one can easily verify using polar coordinates, it converges for X > 0. Fur- 
thermore, it converges uniformly on the set E\ 0 = {X e R | X > Xq > 0}, since for 
^ e Ex 0 , 


0 < f[ e ~Hx 2 +y 2 ) d A - dy < ff e -^ 2 + y2 ) dx d y, 

J Jx 2 +y 2 >l/e 2 JJx 2 +y 2 >l/s 2 

and this last integral tends to 0 as e — > 0 (the original integral F(X) converges at 
X = X 0 > 0). 


Example 2 Suppose, as always, that B(a , r) — {x e K" | \x — a\ < r} is the ball of 
radius r with center at a e M” , and let yeK". Consider the integral 

f lx — v| f |x — v| 

F(v)= / — dx := lim / — — dx. 

Jb( 0,1) (1 - W)" 0,1 -e) (1 - W)“ 

Passing to polar coordinates in K" , we verify that this integral converges only for 
a < 1 . If the value a < 1 is fixed, the integral converges uniformly with respect to 
the parameter y on every compact set Y C R", since |x — y| < M(Y) e R in that 
case. 

We note that in these examples the set of singularities of the integral was inde- 
pendent of the parameter. Thus, if we adopt the concept given above of uniform 
convergence of an improper integral with a fixed set of singularities, it is clear that 
all the basic properties of such improper multiple integrals depending on a parameter 
can be obtained from the corresponding properties of proper multiple integrals and 
theorems on passage to the limit for families of functions depending on a parameter. 

We shall not take the time to explain these facts again, which are theoretically 
already familiar to us, preferring instead to use the machinery we have developed 


“That is, the points in every neighborhood of which the function / is unbounded. If the set X is 
also unbounded, we remove a neighborhood of infinity from it. 
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to study the following very important and frequently encountered situation in which 
the singularity of an improper integral (one-dimensional or multi-dimensional) itself 
depends on a parameter. 


17.5.3 Improper Integrals with a Variable Singularity 

Example 3 As is known, the potential of a unit charge located at the point x e R 3 
is expressed by the formula U(x, y) = . _ | , where y is a variable point of R 3 . If 
the charge is now distributed in a bounded region X C R 3 with a bounded density 
ji(x) (equal to zero outside X), the potential of a charge distributed in this way can 
be written (by virtue of the additivity of potential) as 



(17.75) 


The role of the parameter in this last integral is played by the variable point 
y e R 3 . If the point y lies in the exterior of the set X , the integral (17.75) is a proper 
integral; but if y e X, then \x — y — »• 0 as X 3 x —> y, and y becomes a singularity 
of the integral. As y varies, this singularity thus moves. 

Since U (y) = lim £ ^ + o U e (y), where 



it is natural to consider, as before, that the integral (17.75) with a variable singularity 
converges uniformly on the set Y if U e (y ) =$ U (y) on Y as s — >■ +0. 

We have assumed that |/r,(x)| < M <= R oil X, and therefore 



This estimate shows that U (y) — C/ e (y) | < 2ttMs 2 for every y e R 3 , that is, the 
integral (17.75) converges uniformly on the set Y = R 3 . 

In particular, if we verify that the function U E (y) is continuous with respect to y, 
we will then be able to deduce from general considerations that the potential U ( y ) is 
continuous. But the continuity of U E (y) does not follow formally from Proposition 1 
on the continuity of an improper integral depending on a parameter, since in the 
present case the domain of integration X\B(y, e ) changes when y changes. For that 
reason, we need to examine the question of the continuity of U E {y) more closely. 

We remark that for |y — yol < e. 



x\B(y o.2s) \x-y\ 


p(x) dx 



(X\B(y,e))nB(y 0 ,2e) I* “ y\ 


p (x) dx 
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The first of these two integrals is continuous with respect to y assuming y — 
vo | < £, being a proper integral with a fixed domain of integration. The absolute 
value of the second does not exceed 

C Mdx 9 

/ = 8jrMe 2 . 

JB(yo, 2s) \X ~y\ 

Hence the inequality \U e (y) — t/ £ (yo)| < e + 16 jt Me 2 holds for all values of y suf- 
ficiently close to vo, which establishes that U e (y ) is continuous at the point yo e R 3 . 

Thus we have shown that the potential U (y) is a continuous function in the whole 
space R 3 . 

These examples provide the basis for adopting the following definition. 

Definition 1 Suppose the integral (17.72) is an improper integral that converges 
for each y e Y. Let X e be the portion of the set X obtained by removing from 
X the e -neighborhood of the set of singularities of the integral, 9 and let F s (y) = 
fx f( x > y) d- v - We shall say that the integral (17.72) converges uniformly on the set 
Y if F e (y) =1 F (y) on Y as e — > +0. 

The following useful proposition is an immediate consequence of this definition 
and considerations similar to those illustrated in Example 3. 

Proposition 4 If the function f in the integral (17.72) admits the estimate 
| f(x, y)| < |^j^j „ , where M e 1, r 6 I C K", y e Y C K", and a < n, then the 
integral converges uniformly on Y . 


Example 4 In particular, we conclude on the basis of Proposition 4 that the integral 


Vi (y) 


-L 


/x(x)(x‘ - y') 
|x-y| 3 


d.v, 


obtained by formal differentiation of the potential (17.75) with respect to the vari- 
able / (i = 1, 2, 3) converges uniformly on Y — R 3 , since I ^ 1 — ■ 

As in Example 3, it follows from this that the function V) (y) is continuous on R 3 . 
Let us now verify that the function U{y) - the potential (17.75) - really does 
have a partial derivative and that (y) = V) (y). 

To do this it obviously suffices to verify that 

f b V i (y\y\yl)dy i = U(y\y\y% i=a . 

J a 


9 See the footnote on p. 473. 
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But in fact, 



lMx)(x' - y l ) 
5 — dx 

\x ~ v| 3 


/ /x (x ) dx 

Jx 





The only nontrivial point in this computation is the reversal of the order of inte- 
gration. In general, in order to reverse the order of improper integrals, it suffices to 
have a multiple integral that converges absolutely with respect to the whole set of 
variables. This condition holds in the present case, so that the interchange is justi- 
fied. Of course, it could also be justified directly due to the simplicity of the function 
involved. 

Thus, we have shown that the potential U (y) generated by a charge distributed 
in R 3 with a bounded density is continuously differentiable in the entire space. 

The techniques and reasoning used in Examples 3 and 4 enable us to study the 
following more general situation in a very similar way. 

Let 



(17.76) 


where X is a bounded measurable domain in R", the parameter y ranges over 
the domain Y C R"\ with n < m, (p : X — > R" ! is a smooth mapping satisfy- 
ing rank^'(x) = n, and ||<p , (x)|| > c > 0. that is, defines an n-dimensional 
parametrized surface, or, more precisely, an 77 -path in R"'. Here K e C(R m \0,R), 
that is, the function K (z) is continuous everywhere in R m except at z — 0, near 
which it may be unbounded; and i/f:7fxF-»Risa bounded continuous function. 
We shall assume that for each y e Y the integral (17.76) (which in general is an 
improper integral) exists. 

In the integral (17.75) that we considered above, in particular, we had 
n=/?7, < p{x) = x , i Jr(x,y) = 7f(z) = |z| _1 . 

It is not difficult to verify that under these restrictions on the function (p. Defi- 
nition 1 of uniform convergence of the integral (17.76) means that for every a > 0 
one can choose e > 0 such that 



< a, 


(17.77) 
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where the integral is taken over the set 10 {.r e X | |y — <p(x)\ < e}. 

The following propositions hold for the integral (17.76). 

Proposition 5 If the integral (17.76) converges uniformly on Y under the hypothe- 
ses described above on the functions (p, t \r , and K , then F e C(T, ]R). 

Proposition 6 If it is known in addition that the function i f in the integral (17.76) is 
independent of the parameter y ( that is, t [r(x, v) = t /r(x)) and K e C*^(K m \0, K), 
then if the integral 



converges uniformly on the set y e T, one can say that the function F has a contin- 


uous partial derivative an ‘ d 


dy ' 



(17.78) 


The proofs of these propositions, as stated, are completely analogous to those in 
Examples 3 and 4, and so we shall not take the time to give them. 

We note only that the convergence of an improper integral (under an arbitrary 
exhaustion) implies its absolute convergence. In Examples 3 and 4 the hypothesis 
of absolute convergence was used in the estimates and in reversing the order of 
integration. As an illustration of the possible uses of Propositions 5 and 6, let us 
consider another example from potential theory. 

Example 5 Suppose a charge is distributed on a smooth compact surface S C R 3 
with surface density v(x). The potential of such a charge distribution is called a 
single-layer potential and is obviously represented by the surface integral 



(17.79) 


Suppose v is a bounded function. Then for y f S this integral is proper, and the 
function U (y) is infinitely differentiable outside S. 

But if y e S, the integral has an integrable singularity at the point y. The singu- 
larity is integrable because the surface S is smooth and differs by little from a piece 
of the plane M 2 near the point y e S', and we know that a singularity of type 1 / r a 
is integrable in the plane if or < 2. Using Proposition 5, we can turn this general 
consideration into a formal proof. If we represent S locally in a neighborhood V y of 


10 Here we are assuming that the set X itself is bounded in R" . Otherwise one must supplement 
inequality (17.77) with the analogous inequality in which the integral is taken over the set {x £ X \ 
\x\ > 1/e}. 
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the point y e S in the form a- = <p(t), where t e V t C R 2 and rank<p' = 2, then 



and, applying Proposition 2, we also verify that the integral (17.79) represents a 
function U (y) that is continuous on the entire space R 3 . 

Outside the support of the charge, as already noted, the three-dimensional poten- 
tial (17.75) and the single-layer potential (17.79) are infinitely differentiable. Carry- 
ing out this differentiation under the integral sign, we verify in a unified manner that 
outside the support of the charge the potential, like the function l/|x — y|, satisfies 
Laplace’s equation AU — 0 in R 3 , that is, it is a harmonic function in this domain. 


17.5.4 * Convolution , the Fundamental Solution, and Generalized 
Functions in the Multidimensional Case 

a. Convolution in R" 

Definition 2 The convolution u * v of real- or complex-valued functions u and v 
defined in R" is defined by the relation 



(17.80) 




Example 6 Comparing formulas (17.75) and (17.80), we can conclude, for example, 
that the potential U of a charge distributed in R 3 with density p (x ) is the convo- 
lution (p * E) of the function p and the potential £ of a unit charge located at the 
origin of R 3 . 

Relation (17.80) is a direct generalization of the definition of convolution given 
in Sect. 17.4. For that reason, all the properties of the convolution considered in 
Sect. 17.4 for the case n= 1 and their proofs remain valid if R is replaced by R' J . 

An approximate identity in R" is defined just as in R with R replaced by R" and 
U (0) understood to be a neighborhood of the point 0 e R" in R' ! . 

The concept of uniform continuity of a function / : G — »■ C on a set E C G, 
and with it the basic Proposition 5 of Sect. 17.4 on convergence of the convolution 
f * A a to /, also carry over in all its details to the multi-dimensional case. 

We note only that in Example 3 and in the proof of Corollary 1 of Sect. 17.4 
a must be replaced by \x\ in the definition of the functions A n (x) and <p(x). Only mi- 
nor changes are needed in the approximate identity given in Example 4 of Sect. 17.4 
for the proof of the Weierstrass theorem on approximation of periodic functions by 
trigonometric polynomials. In this case it is a question of approximating a function 
/(a 1 , . . . , a") that is continuous and periodic with periods T\,Tj, . . ■ , T n respec- 
tively in the variables a 1 , a 2 , . . . , a" . 
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The assertion amounts to the statement that for every s > 0 one can exhibit a 
trigonometric polynomial in n variables with the respective periods 7’i .Ti, ... , T n 
that approximates / on M” within e. 

We confine ourselves to these remarks. An independent verification of the prop- 
erties of the convolution (17.80) for n e N, which were proved for the case n — 1 in 
Sect. 17.4, will be an easy but useful exercise for the reader, helping to promote an 
adequate understanding of what was said in Sect. 17.4. 


b. Generalized Functions of Several Variables 

We now take up certain multi-dimensional aspects of the concepts connected with 
generalized functions, which were introduced in Sect. 17.4. 

As before, let C ioo HG) and C^iG) denote respectively the sets of infinitely 
differentiable functions in the domain G C R" and the set of infinitely differen- 
tiable functions of compact support in G. If G — R", we shall use the respective 
abbreviations C (00) and Cg 00) . Let m := (mi m n ) be a multi-index and 



In C£°\G) we introduce convergence of functions. As in Definition 7 of 
Sect. 17.4, we consider that <pk — > <p in C^iG) as k -> oo if the supports of all 
the functions of the sequence {(pp } are contained in one compact subset of G and 
(p^ =1 (p on G for every multi-index m as k — »■ oo, that is, the functions con- 
verge uniformly, and so do all of their partial derivatives. 

Given this, we adopt the following definition. 

Definition3 The vector space Cq°°\G) with this convergence is denoted V(G) 
(and simply V if G — R") and is called the space of fundamental or test functions. 

Continuous linear functionals on T>(G) are called generalized functions or dis- 
tributions. They form the vector space of generalized functions, denoted T>\G) (or 
V when G — W'). 

Convergence in V(G), as in the one-dimensional case, is defined as weak (point- 
wise) convergence of functionals (see Definition 6 of Sect. 17.4). 

The definition of a regular generalized function carries over verbatim to the 
multi-dimensional case. 

The definition of the 5-function and the 5-function shifted to the point xq e G 
(denoted 5(xo), or more often, but not always happily, 5(x — xo)) also remain the 
same. 

Now let us consider some examples. 
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Example 7 Set 


A,(x) := 


(! 2a\firt) n 


l*r 

4a 2 1 


where a > 0, t > 0, x e R" . We shall show that these functions, regarded as regular 
distributions in R", converge to the S-function on R" as t +0. 

For the proof it suffices to verify that the family of functions A, is an approximate 
identity in R" as t — > +0. 

Using a change of variable, reduction of the multiple integral to an iterated inte- 
gral, and the value of the Euler-Poisson integral, we find 


/ 

J Mi- 


A t (x ) dr 


! 




" Jr, 


— I x I 2 

e 2aV? j 


2ajt 


' if”, 

(V^r)” Vi-oo 


e“" d« ) = 1. 


Next, for any fixed value of r > 0 we have 
A t (x) dx 


/ A r (x) dx = — = — f 

JB(0,r) (Vx) n Jb 




d?-> 1, 




as t +0. 

Finally, taking account of the fact that A,{x) is nonnegative, we conclude that 
these functions indeed constitute an approximate identity in R” . 

Example 8 A generalization of the 5-function (corresponding, for example, to a unit 
charge located at the origin in R" ) is the following generalized function S $ (corre- 
sponding to a distribution of charge over a piecewise-smooth surface S with a dis- 
tribution of unit surface density). The effect of S$ on the function tp e T> is defined 
by the relation 

{8 s ,(p) := J^cp(x) da. 

Like the distribution 5, the distribution S$ is not a regular generalized function. 

Multiplication of a distribution by a function in T> is defined in R" just as in the 
one-dimensional case. 

Example 9 If p. e D. then p.8s is a generalized function acting according to the rule 

(p.8 s ,(p) = J <p(x)fj,(x)dcr. (17.81) 

If the function p(x) were defined only on the surface S, Eq. (17.81) could be 
regarded as the definition of the generalized function p.8g. By natural analogy, the 
generalized function introduced in this way is called a single layer on the surface S 
with density p. 

Differentiation of generalized functions in the multi-dimensional case is defined 
by the same principle as in the one-dimensional case, but has a few peculiarities. 


17.5 Multiple Integrals Depending on a Parameter 


481 


If F e V (G) and Gel", the generalized function is defined by the relation 


3 F \ 



9 V\ 

dx‘ /' 


It follows that 

(t F (m \<p ) = (-l) |m| (F, (p im) ), (17.82) 

where m — {m\ mf) is a multi-index and \m\ = YTi=\ m i ■ 

o2 p' o2 r 

It is natural to verify the relation , . But that follows from the 

^ dx l dxJ dxJdx 1 

equality of the right-hand sides in the relations 

\dx l dx J I \ dxJdx 1 I 
\dxJdx l I \ dx l dxJ I 


q 2 o2 

which follows from the classical equality = dx jg x i . which holds for every 

function q> e V. 

Example 10 Now consider an operator Z) = a, n D m , where m = (in ,m n ) 
is a multi-index, D m — (gyr)’" 1 ■ ■■■ ■ a m are numerical coefficients, and 

the sum extends over a finite set of multi-indices. This is a differential operator. 

The transpose or adjoint of D is the operator usually denoted ' D or D* and 
defined by the relation 

(. DF,(p)-.[F, t D(p ), 

which must hold for all <p e T> and F e V . Starting from Eq. (17.82), we can now 
write the explicit formula 


r T> = J](— l) |m| a m O m 

m 

for the adjoint of the differential operator D. 

In particular, if all the values of \m\ are even, the operator D is self-adjoint, that 
is , ' D — D. 

It is clear that the operation of differentiation in 2?'(K ,! ) preserves all the prop- 
erties of differentiation in P'(M). However, let us consider the following important 
example, which is specific to the multi-dimensional case. 

Example 11 Let S' be a smooth (n — 1) -dimensional submanifold of R", that is, S 
is a smooth hypersurface. Assume that the function / defined on R" \ S is infinitely 
differentiable and that all its partial derivatives have a limit at every point x e S 
under one-sided approach to x from either (local) side of the surface S. 
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The difference between these two limits will be the jump / of the partial 
derivative under consideration at the point x corresponding to a particular direction 
of passage across the surface S at x. The sign of the jump changes if that direction is 
reversed. The jump can thus be regarded as a function defined on an oriented surface 
if, for example, we make the convention that the direction of passage is given by an 
orienting normal to the surface. 

The function is defined, continuous, and locally bounded outside S, and by 
the assumptions just made / is locally ultimately bounded upon approach to the 
surface S itself. Since S’ is a submanifold of R" , no matter how we complete the 
definition of on .S', we obtain a function with possible discontinuities on only S, 
and hence locally integrable in R" . But integrable functions that differ on a set of 
measure zero have equal integrals, and therefore, without worrying about the values 
on S, we may assume that generates some regular generalized function {|4} 
according to the rule 



We shall now show that if / is regarded as a generalized function, then the fol- 
lowing important formula holds in the sense of differentiation of generalized func- 
tions: 


9/ 

dx‘ 



+ ( J/)scosMs , 


(17.83) 


where the last term is understood in the sense of Eq. (17.81), (Jf)s is the jump of 
the function / at x e S corresponding to either of the two possible directions of the 
unit normal n to S at x, and cos a,- is the projection of n onto the x'-axis (that is, 
n = (cosai, . . . , cosa/t)). 


Proof Formula (17.83) generalizes Eq. (17.64), which we use to derive it. 
For definiteness we consider the case i = 1 . Then 



Here the jump J / of / is taken at the point x = (x 1 , x 2 , . . . , x") e S as one 
passes through the surface at that point in the direction of the positive xi-axis. The 


17.5 Multiple Integrals Depending on a Parameter 


483 


value of the function <p in computing the product (\ f)<p is taken at the same point. 
Hence, this last integral can be written as a surface integral of first kind 


J (J/)<pcoso;i der, 


where a\ is the angle between the direction of the positive x\ -axis and the normal to 
S at x, direct so that in passing through x in the direction of that normal the function 
/ has precisely the jump J /. This means only that cosai > 0. It remains only to 
remark that if we choose the other direction for the normal, the sign of the jump 
and the sign of the cosine would both reverse simultaneously; hence the product 
(J /) cosaq does not change. □ 

Remark 1 As can be seen from this proof, formula (17.83) holds once the jump 
(J f)s of / is defined at each point x e S, and a locally integrable partial derivative 
exists outside of S in E" , perhaps as an improper integral generating a regular 
generalized function {^ 7 }- 

Remark 2 At points x e S at which the direction of the x 1 -axis is not transversal 
to S, that is, it is tangent to S, difficulties may arise in the definition of the jump J / 
in the given direction. But it can be seen from (17.83) that its last term is obtained 
from the integral 



The projections of the set E on x 2 , . . . , x n -hyperplane has (n — 1) -dimensional 
measure zero and therefore has no effect on the value of the integral. Hence we can 
regard the form (17.83) as having meaning and being valid always if (J f)$ cos a,- is 
given the value 0 when cos a,- = 0. 

Remark 3 Similar considerations make it possible to neglect sets of area zero; there- 
fore one can regard formula (17.83) as proved for piecewise-smooth surfaces. 

As our next example we shall show how the classical Gauss-Ostrogradskii for- 
mula can be obtained directly from the differential relation (17.83), and in a form 
that is maximally free of the extra analytic requirements that we informed the reader 
of previously. 

Example 12 Let G be a finite domain in R" bounded by a piecewise-smooth sur- 
face S. Let A = (A 1 , . . . , A") be a vector field that is continuous in G and such that 
the function div A = Y1!i = 1 §~r i s defined in G and integrable on G, possibly in the 
improper sense. 

If we regard the field A as zero outside G, then the jump of this field at each point 
x of the boundary S of the domain G when leaving G is — A(jr). Assuming that n is 
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a unit outward normal vector to S, applying formula (17.83) to each component A' 
of the field A and summing these equalities, we arrive at the relation 

divA = {divA}- (A-n)S s , (17.84) 

in which A ■ n is the inner product of the vectors A and n at the corresponding point 
x e S. 

Relation (17.84) is equality of generalized functions. Let us apply it to the func- 
tion \J/ e C ( 0 °° } equal to 1 on G (the existence and construction of such a function 
has been discussed more than once previously). Since for every function <p e V 

(divA ,<p) = - f (A-V<p)dx (17.85) 

J R" 

(which follows immediately from the definition of the derivative of a generalized 
function), for the field A and the function \J/ we obviously have (div A, i fr) = 0. But, 
when we take account of Eq. (17.84) this gives the relation 

0 = ({div A}, i j/) - ((A • n)<$ 5 , i/r), 

which in classical notation 

0 = / div Ad* — / (A-n)dcr (17.86) 

Jg J s 

is the same as the Gauss-Ostrogradskii formula. 

Let us now consider several important examples connected with differentiation 
of generalized functions. 

Example 13 We consider the vector field A = defined in R 3 \0 and show that in 
the space V' (R 3 ) of generalized functions we have the equality 

div = 4ttS. (17.87) 

W 3 

We remark first that for x ^ 0 we have div = 0 in the classical sense. 

hr 

Now, using successively the definition of div A in the form (17.85), the definition 
of an improper integral, the equality div = 0 for * ^ 0, the Gauss-Ostrogradskii 

|X | 

formula (17.86), and the fact that (p has compact support, we obtain 

{^w v ) = -L(ik Vv(x) ) dx = 

= lim — f (rrT V<p(x))dx = 

e ~‘ Js<\x\<\/e V K I / 
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f ( x ■ n) 

= lim — / cp(x) — da = An(p(0) = (4jt<5, tp). 

£—>+0 J\ x \— £ |x| 

For the operator A : T>'(G) — »■ as before, we define a fundamental solution 

to be a generalized function E C D\G) for which A(E) = 8. 

Example 14 We verify that the regular generalized function E{x) — — 4t * ^ in 
£>'(R 3 ) is a fundamental solution of the Laplacian A = (^r) 2 + (^i) 2 + (^i) 2 - 
Indeed, A — divgrad, and grad E(x) = 3 for x ^ 0, and therefore the equal- 

ity divgrad E — 8 follows from relation (17.87). 

As in Example 13, one can verify that for any n e N, n > 2, we have the following 
relation in R" : 

div — = o n 8, (17.87') 

\x\ n 

where er„ = is the area of the unit sphere in R" . 

Hence we can conclude upon taking account of the relation A = div grad that 

A In |X| = 2ttS in R 2 


and 


A 


1 

\x\ n ~ 2 


= — (« — 2 )a n 8 


in R", n > 2. 


Example 15 Let us verify that the function 

H{t) -LI 2 . 

E{x,t)= == — e 4a ~‘ , 

(2a^fnt) n 

where x e R", t e R, and H is the Heaviside function (that is, we set E(x,t) = 0 
when t < 0) satisfies the equation 

{ 1 ,-° 2a ) e = s - 

Here A is the Laplacian with respect to x in R” , and 8 = S(x, t) is the 5-function 
in R" xR, = R' !+1 . 

When r > 0, we have E e C ( °°\ R" +1 ) and by direct differentiation we verify 
that 
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Taking this fact into account along with the result of Example 7, we obtain for 
any function ip e 27(R" +1 ) 


3 


— -a A)E,<p} = 


= -( E A - + aM W = 


p+oo p 

=-/ dt 
Jo Jm 


E(x, r)( + a 2 Aip ) dr = 


= — lim 

£->+0 


[ dt f E(x, + a 2 Aip ) dx = 

Je JB . " 


dt 


= lim 

£ 


in, [ f 

■ > + 0 LiK" 


E(x, e)(p(x, 0) dx ■ 


p+oo p 

/ dr / 

Je J R' ; 


3 E 0 

a~AE )<»dx 

dt 


= lim 

£-»■ +0 

= lim 

£->+0 


/ E(x, e)<p(x, 0)dx + / E(x, e)[ip(x, s) — ip(x, 0)) dx 
J R" JR" 

/ 

Jr ,; 


E(x, e)ip(x, 0) dr = <p( 0, 0) = ( S , ip). 


Example 16 Let us show that the function 


1 


E(x, t) = — H(at — |x|), 

where a > 0, x e K* , t e K, 1 , and H is the Heaviside function, satisfies the equation 


— -cr — » £ = <5, 


3 r- 


dx 2 


in which S = S(x, t) is the 3-function in the space 22' (R* x M, ) = V' 
Let i p e 2?(R 2 ). Using the abbreviation D a — a 2 we find 

(D a E,(p) — {E,U a (p)= I dx j E(s, t)U a ip{x,t)dt = 

J J 


i r+°° r+°° d 2 cp a r+°° r 

— dx I — V dr / dt 

2 a J_ oo Ja dt 2 2 J 0 ]_ 

r i 


3 (p 

-at dx 2 


dx 


1 /’ + °° dip ( |X| 

2 a J —oo d t \ ’ a 


- . \ dip dip 

dr - - / \—(at,i)-—(-at,i) 

2 ./ o |_ ox ox 

1 r+°° dip 


dt — 


1 r +0 ° dip i r ~r 

— / — (at,t)dt / — (— at,t)dt = 

2 Jo dr 2 Jo 


dr 
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1 

2 


<P(0, 0) + ^(0, 0) = <p(0, 0) = (<$, <p). 


In Sect. 17.4 we have discussed in detail the role of the system function of the 
operator and the role of the convolution in the problem of determining the input u 
from the output u of a translation-invariant linear operator Au — u. Everything that 
has been discussed on that score carries over to the multi-dimensional case without 
any changes. Hence, if we know the fundamental solution E of the operator A, that 
is, if AE = 8, then one can present the solution u of the equation Au — f as the 
convolution u — f * E. 

Example 17 Using the function E(x, t) of Example 16, one can thus present the 
solution 



of the equation 



which is the convolution f * E of the functions / and E and necessarily exists 
under the assumption, for example, that the function / is continuous. By direct 
differentiation of the resulting integral with respect to the parameters, one can easily 
verify that u(x,t) is indeed a solution of the equation D a u — f . 

Example 18 Similarly, on the basis of the result of Example 15 we find the solution 



of the equation jj- — Au — /, for example, under the assumption that the function 
/ is continuous and bounded, which guarantees the existence of the convolution 
f * E. We note that these assumptions are made only for example, and are far from 
obligatory. Thus, from the point of view of generalized functions one could pose 
the question of the solution of the equation — Au = f taking as f(x,t ) the 
generalized function <p(x) ■ 8(t), where <p e 2?(M") and 8 e D' (R). 

The formal substitution of such a function / under the integral sign leads to the 
relation 



Applying the rule for differentiating an integral depending on a parameter one 
can verify that this function is a solution of the equation — a A u = 0 for t > 0. 
We note that u( x, t ) —> qj (x ) as t — > +0. This follows from the result of Exam- 
ple 7, where it was established that the family of functions encountered here is an 
approximate identity. 
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Example 19 Finally, recalling the fundamental solution of the Laplace operator ob- 
tained in Example 14, we find the solution 



of the Poisson equation An = —Arcf, which up to notation and relabeling is the 
same as the potential (17.75) for a charge distributed with density /, which we 
considered earlier. 

If the function / is taken as v(x)8$ , where S is a piecewise smooth surface in 
R 3 , formal substitution into the integral leads to the function 



which, as we know, is a single-layer potential; more precisely, the potential of a 
charge distributed over the surface ScK 3 with surface density v(x). 


17.5.5 Problems and Exercises 

1. a) Reasoning as in Example 3, where the continuity of the three-dimensional 
potential (17.75) was established, show that the single-layer potential (17.79) is 
continuous. 

b) Verify the full proof of Propositions 4 and 5. 

2. a) Show that for every set M C R" and every e > 0 one can construct a function 
/ of class C 1 * 00 * (R" , R) satisfying the following three conditions simultaneously: 
Wx e R" (0 < f(x) < 1); Wx e M ( f(x ) = 1); supp/ c M e , where M E is the e- 
blowup (that is, the e -neighborhood) of the set M. 

b) Prove that for every closed set M in R" there exists a nonnegative function 
/ e C (oo) (R", R) such that (f(x) = 0)o(jte M). 

3. a) Solve Problems 6 and 7 of Sect. 17.4 in the context of a space R" of arbitrary 
dimension. 

b) Show that the generalized function 8$ (single layer) is not regular. 

4 . Using convolution, prove the following versions of the Weierstrass approxima- 
tion theorem. 

a) Any continuous function / : / -> R on a compact n -dimensional interval 
I C R" can be uniformly approximated by an algebraic polynomial in n variables. 

b) The preceding assertion remains valid even if / is replaced by an arbitrary 
compact set K C R and we assume that f e C(K,C). 

c) For every open set G C R" and every function f e C (m HG, R) there exists 
a sequence {P^} of algebraic polynomials in n variables such that P on 
each compact set K C G as k — »• oo for every multi-index a — (a i, ... ,a„) such 
that | a | < m. 
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d) If G is a bounded open subset of R" and f e C <00) (G,R), there exists a 
sequence {h} of algebraic polynomials in n variables such that D / ( “^ for 
every a — (ct] , . . . , a n ) as k — » oo. 

e) Every periodic function / e C (R" , R) with periods T \ , 7?, . . . , T n in the vari- 
ables x l , ... ,x n , can be uniformly approximated in R" by trigonometric polyno- 
mials in n variables having the same periods T\ . Tj. ... . T n in the corresponding 
variables. 

5. This problem contains further information on the averaging action of convolu- 
tion. 

a) Previously we obtained the integral Minkowski inequality 

\a(x) + b(x)\ p dx^j < \a\ p (x) dx^j + \b\ p (x) dx^j 

for p > 1 on the basis of this numerical Minkowski inequality. 

The integral inequality in turn enables us to predict the following generalized 
integral Minkowski inequality : 


p \ t/p r / r \ Vp 

d.rj < J[J \f\ p (x,y)dxj dy. 

Prove this inequality, assuming that p > 1 , that X and Y are measurable subsets 
(for example, intervals in R”' and R" respectively), and that the right-hand side of 
the inequality is finite. 

b) By applying the generalized Minkowski inequality to the convolution / * g, 
show that the relation \\f * g\\ p < ||/||i • llgllp holds for p > 1, where, as always, 
IImII p = \u\ p (x)dx) l/p . 

c) Let (p e Cq°° > (R" , R) with 0 < q>(x) < 1 on R" and (p(x) dx — 1 . Assume 
that (p s (x ) and f e f *(p e for e > 0. Show that if / e 1Z P (W) (that is, 
if the integral f Rn \ f\ p {x)dx exists), then f s e C (oo) (R", R) and \\f e \\ p < ||/||p. 

We note that the function f e is often called the average of the function f with 
kernel (p e . 

d) Preserving the preceding notation, verify that the relation 

ll/e-/llp,/< sup \\T h f ~ f\\p,I, 

\h\<e 



holds on every interval I C R", where || m ||/j,/ = (// | u\ p {x)dx) l ! p and r hf(x) — 
f(x - h). 

e) Show that if / e 1Z P (W'), then ||r /,/ — f\\ p j — > 0 as h -> 0. 

f) Prove that \\f e \\ p < \\f\\ p and \\f E — f\\ p -* 0 as e -> +0 for every function 
feK p (R n ),p>\. 

g) Let lZ p (G) be the normed vector space of functions that are absolutely inte- 
grable on the open set GcR” with the norm || || p q. Show that the functions of 
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class C (oo fG) fl IZp(G) form an everywhere-dense subset of lZ p (G) and the same 
is true for the set Cq\G) CMZ p (G). 

h) The following proposition can be compared with the case p = oo in the pre- 
ceding problem: Every continuous function on G can be uniformly approximated 
on G by functions of class C (0 °^(G). 

i) If / is a T -periodic locally absolutely integrable function on R, then, setting 
WfWp.T = ij“ +T |/| p Cx)dx) 1/p , we shall denote the vector space with this norm 
by 7 Z p . Prove that \\f s — f\\ p j —*■ 0 as s —*■ +0. 

j) Using the fact that the convolution of two functions, one of which is peri- 
odic, is itself periodic, show that the smooth periodic functions of class C (oa> are 
everywhere dense in 7 Z T p . 

6. a) Preserving the notation of Example 1 1 and using formula (17.83), verify that 
if / e C (1) (K"\S), then 


9 2 / 

dx' dx-i 


d 2 f 

dx 1 dxJ 


+ 




cos a j8s- 
S 


b) Show that the sum |4) 5 cos a; equals the jump (J ^)s of the nor- 

mal derivative of the function / at the corresponding point x e S, this jump being 
independent of the direction of the normal and equal to the sum + y^)(x) of 
the normal derivatives of / at the point x from the two sides of the surface S. 

c) Verify the relation 

Af = {Af} + + ^{(If)sSs), 

where ^ is the normal derivative, that is, F, tp) := — (F, |^), and (Jf)s is the 
jump of the function / at the point x e S in the direction of the normal n. 

d) Using the expression just obtained for Af, prove the classical Green’s for- 
mula 

under the assumption that G is a finite domain in R" bounded by a piecewise- 
smooth surface S ; / and (p belong to C (1, (G) fl C <2> (G), and the integral on the 
left-hand side exists, possibly as an improper integral. 

e) Show that if the 5-function corresponds to a unit charge located at the origin 
0 in M" , and the function — corresponds to a dipole with electric moment + 1 
located at 0 and oriented along the x 1 -axis (see Problem lie) of Sect. 17.4) and 
the function v(x)Ss is the single layer corresponding to a charge distribution over 
the surface S with surface density v(x), then the function — ^j(v(jc)5s), called the 
double layer, corresponds to a distribution of dipoles over the surface S oriented by 
the normal n and having surface density moment v(x). 
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f) Setting (p — — in Green’s formula and using the result of Example 14, 

show that every harmonic function / in the domain G in the class C n> (G) can be 
represented as the sum of a single-layer and a double-layer potential located on the 
boundary S of G. 

7. a) The function ^ is the potential of the electric field intensity A = — cre- 
ated in R 3 by a unit charge located at the origin. We also know that 

div( r)= 47 ri 5 , div( — | = A-TtqS, divgradf — ) = 4tz8. 

\ \xV J \ \x\ 5 J VW/ 

Starting from this, explain why it was necessary to assume that the function 
U (x) = 3 must satisfy the equation AU — —Anp.. Verify that it does in- 

deed satisfy the Poisson equation written here. 

b) A physical corollary of the Gauss-Ostrogradskii formula, known in electro- 
magnetic field theory as Gauss ' theorem is that the flux across a closed surface S of 
the intensity of the electric field created by charges distributed in R 3 equals Q /eq 
(see pp. 279 and 280), where Q is the total charge in the region bounded by the 
surface S. Prove this theorem of Gauss. 

8 . Verify the following equalities, understood in the sense of the theory of general- 
ized functions. 

a) AE = 8, if 


E(x) = 


2j t 


In |x| 

n” z ) . 

2n n / 2 (n—2) 1 


I — n — 2 


forx e R 2 , 
for x e R", n > 2. 


b) (A + k 2 )E = 8,if £(x) = 

c) O a E = 8, where □„ = ^ - 




e -ik\x\ 


" IjrjTf °r if E(x) — — 4 ^r 

d \2 j l/' 9 

^ 3a: 1 


■ fl 2 [(^ 7 r) / H H(gfF> z ], and E = 


for x e R 2 or E = 


jm tS Sa , = 8(a 2 t 2 - |x| 2 ) for x e 


and x e R 3 . 

_ H(at-\x |) 

27 Tci*J a 2 t 2 — \x\ 2 

t e R. Here Hit) is 


4na 2 t 2na' 

the Heaviside function, S at = {x e R 3 | |x| = at} is a sphere, and a > 0. 

d) Using the preceding results, present the solution of the equation Au — f for 
the corresponding differential operator A in the form of the convolution f * E and 
verify, for example, assuming the function / continuous, that the integrals depend- 
ing on a parameter that you have obtained indeed satisfy the equation Am = /. 


9. Differentiation of an integral over a liquid volume. 

Space is filled with a moving substance (a liquid). Let v = v(t, x) and p — pit, x) 
be respectively the velocity of displacement and the density of the substance at 
time t at the point x. We observe the motion of a portion of the substance filling the 
domain Q o at the initial moment of time. 


a) Express the mass of the substance filling the domain E2 t obtained from f2o at 
time t and write the law of conservation of mass. 
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b) By differentiating the integral Fit ) = f n f(t, x) dm with variable do- 
main of integration £ 2 , (the volume of liquid), show that F'(t) = f Qi |yd&> + 
fan, f( v ’ n ) dcr ’ where £2 t , 3 £2 t , dm, dcr, n, v, (, ) are respectively the domain, its 
boundary, the element of volume, the element of area, the unit outward normal, the 
flow velocity at time t at corresponding points, and the inner product. 

c) Show that F'{t) in problem b) can be represented in the form F'(t) — 

/i2,(§7 + div (/ ,; )) d "- 

d) Comparing the results of problems a), b), and c), obtain the equation of con- 
tinuity + div(pu) = 0. (In this connection, see also Sect. 14.4.2.) 

e) Let \£2 t \ be the volume of the domain £2 t . Show that divudcu. 

f) Show that the velocity held v of the how of an incompressible liquid is 
divergence-free (div v — 0) and that this condition is the mathematical expression 
of the incompressibility (conservation of volume) of any portion of the evolving 
medium. 

g) The phase velocity held (p, q) of a Hamiltonian system of classical mechan- 
ics satishes the Hamilton equations p = — q — where H — H(p,q) is 
the Hamiltonian of the system. Following Liouville, show that a Hamiltonian how 
preserves the phase volume. Verify also that the Hamiltonian FI (energy) is constant 
along the streamlines (trajectories). 


Chapter 18 

Fourier Series and the Fourier Transform 


18.1 Basic General Concepts Connected with Fourier Series 

18.1.1 Orthogonal Systems of Functions 

a. Expansion of a Vector in a Vector Space 

During this course of analysis we have mentioned several times that certain classes 
of functions form vector spaces in relation to the standard arithmetic operations. 
Such, for example, are the basic classes of analysis, which consist of smooth, contin- 
uous, or integrable real-, complex-, or vector- valued functions on a domain IcK". 
From the point of view of algebra the equality 


/ = «t/t H VUnfn , 


where /, f \ , . . . , f n are functions of the given class and a; are coefficients from 
R. or C, simply means that the vector / is a linear combination of the vectors 
f\, .... f n of the vector space under consideration. 

In analysis, as a rule, it is necessary to consider “infinite linear combinations” - 
series of functions of the form 


OO 



( 18 . 1 ) 


k=i 


The definition of the sum of the series requires that some topology (in particular, 
a metric) be defined in the vector space in question, making it possible to judge 
whether the difference / — S„ tends to zero or not, where S n = Yfk=\ a kfk- 

The main device used in classical analysis to introduce a metric on a vector space 
is to define some norm of a vector or inner product of vectors in that space. Sec- 
tion 1 0. 1 was devoted to a discussion of these concepts. 

We are now going to consider only spaces endowed with an inner product (which, 
as before, we shall denote (, )). In such spaces one can speak of orthogonal vectors, 
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orthogonal systems of vectors, and orthogonal bases, just as in the case of three- 
dimensional Euclidean space familiar from analytic geometry. 

Definition 1 The vectors x and y in a vector space endowed with an inner product 
(, } are orthogonal (with respect to that inner product) if (x, y) = 0. 

Definition 2 The system of vectors {jc*; k e K] is orthogonal if the vectors in it 
corresponding to different values of the index k are pairwise orthogonal. 

Definition 3 The system of vectors {e&; k e K] is orthononnalized (or orthonor- 
mal ) if (e,-, ef) = S ij for every pair of indices i, j e K, where 8jj is the Kronecker 
symbol, that is, <5; j = { g 

Definition 4 A finite system of vectors x\,...,x n is linearly independent if the 

equality a\x\ +« 2 ^ 2 H \-ct n x„ = 0 is possible only when on = 012 — ■ ■ ■ — a n — 0 

(in the first equality 0 is the zero vector and in the second it is the zero of the 
coefficient field). 

An arbitrary system of vectors of a vector space is a system of linearly indepen- 
dent vectors if every finite subsystem of it is linearly independent. 

The main question that will interest us now is the question of expanding a vector 
in a given system of linearly independent vectors. 

Having in mind later applications to spaces of functions (which may be infinite- 
dimensional as well) we must reckon with the fact that such an expansion may, in 
particular, lead to a series of the type (18.1). That is precisely where analysis enters 
into the study of the fundamental and essentially algebraic question we have posed. 

As is known from analytic geometry, expansions in orthogonal and orthonormal 
systems have many technical advantages over expansions in arbitrary linearly inde- 
pendent systems. (The coefficients of the expansion are easy to compute; it is easy to 
compute the inner product of two vectors from their coefficients in an orthonormal 
basis, and so on.) 

It is for that reason that we shall be mainly interested in expansions in orthonor- 
mal systems. In function spaces these will be expansions in orthogonal systems of 
functions or Fourier 1 series, to the study of which this chapter is devoted. 


'J.-B.J. Fourier (1768-1830) - French mathematician. His most important work Theorie analy- 
tique de la chaleur ( 1 822) contained the heat equation derived by Fourier and the method of sepa- 
ration of variables (the Fourier method) of solving it (see p. 510). The key element in the Fourier 
method is the expansion of a function in a trigonometric (Fourier) series. Many outstanding math- 
ematicians later undertook the study of the possibility of such a representation. This, in particular, 
led to the creation of the theory of functions of a real variable and set theory, and it helped to 
promote the development of the very concept of a function. 
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b. Examples of Orthogonal Systems of Functions 

Extending Example 12 of Sect. 10.1, we introduce an inner product 

</,*>:= f(f-g)(x)6x (18.2) 

Jx 

on the vector space JZ 2 (X,C ) consisting of functions on the set X C R" that are 
locally square-integrable (as proper or improper integrals). 

Since \f - ~g\ < 2 O/I 2 + |g| 2 ), the integral in (18.2) converges and hence defines 
(f,g) unambiguously. 

If we are discussing real-valued functions, relation (18.2) in the real space 
72-2 (x , R) reduces to the equality 

</,*>:= f(f-g)(x)dx. (18.3) 

Jx 

Relying on properties of the integral, one can easily verify that all the axioms for 
an inner product listed in Sect. 10.1 are satisfied in this case, provided we identify 
two functions that differ only on a set of n -dimensional measure zero. Throughout 
the following, in the text portion of the section, inner products of functions will be 
understood in the sense of Eqs. (18.2) and (18.3). 

Example 1 We recall that for integers m and n 


f e imx ■ e~ inx dx = 

J — Tt 

o, 

27 r. 

K K 

% II 
£ £ 

(18.4) 

r 

1 cos mx cos nx dA' = 

—71 

0 , 

7 r, 
2ic, 

if m ^ n , 
if m =n ^ 0, 
if m = n = 0; 

(18.5) 

r 

1 cos mx sin nx dv = f 
J —71 

); 


(18.6) 

r 

/ sin mx sin nx dx = 

J —71 

0 , 

7 r, 

d 

sf K 

% II 
£ £ 

(18.7) 


These relations show that {e' nx ; n e Z} is an orthogonal system of vectors in 
the space tt], C) relative to the inner product (18.2) and the trigonometric 

system {1, cos nx, sinn.r; n e N} is orthogonal in 72-2 ([ — jt, tt], R). If we regard the 
trigonometric system as a set of vectors in 72-2 ([ — n, it], C), that is, if we allow 
linear combinations of them to have complex coefficients, then by Euler’s formulas 
e mx — cos nx + i sin nx, cos nx — \(e ,nx + e~ mx ), sin nx — ^ (e" IJt — e~ lnx ), we see 
that these two systems can be expressed linearly in terms of each other, that is, they 
are algebraically equivalent. For that reason the exponential system {e" ,A : n e Z} is 
also called the trigonometric system or more precisely the trigonometric system in 
complex notation. 
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Relations ( 1 8.4)— ( 1 8.7) show that these systems are orthogonal, but not normal- 
ized, while the systems {V=e' nx \ n e Zj and 

V 27T 


11 1 

, — = cos nx, —= sin nx; n e 

v 2 jt V n V - 7r 


N 


are orthonormal. 

If the closed interval [— n, tx] is replaced by an arbitrary closed interval [— Z, Z] C 
R, then by a change of variable one can obtain the analogous systems {e' T nx \neZ} 
and {1, cos jnx, sin jnx; n e N}, which are orthogonal in the spaces IZ 2 ([— Z, Z], C) 
and IZ 2 ([— Z, Z], R) and also the corresponding orthonormal systems 

1 1 1 1 7f 1 n 1 

—=e 1 ;ne Z) and —=, —= cos — nx, —= sin — nx; n e N > . 

IV2Z I Wll Vl 1 Vl 1 I 

Example 2 Let I x be an interval in R" ! and I y an interval in R", and let {//(x)} be 
an orthogonal system of functions in IZ 2 (Zt, M) and { t ?;(y)} an orthogonal system 
of functions in 7\R( l\, R)- Then, as follows from Fubini’s theorem, the system of 
functions {ujj (x, y) := fj{x)gj{y)} is orthogonal in IZ 2 VX x I y , R). 


Example 3 We remark that for u ^ ft 

f 

Jo 

ft tanaZ — a tan ft l 

= cos al cos ft l ■ x x, . 

a 1 — ft *- 

Hence, if a and ft are such that tal “ y; = , the original integral equals zero. 

Consequently, if < §2 <•■•<§«<••• is a sequence of roots of the equa- 
tion tan§Z = eft, where c is an arbitrary constant, then the system of functions 
{sin(f„x); n e N} is orthogonal on the interval [0, Z]. In particular, for c — 0, we 
obtain the familiar system {sin(j«x); n e N}. 

Example 4 Consider the equation 

+ q(x)Ju(x) = Xu(x), 

where q e C (00, ([n r , b], R) and X is a numerical coefficient. Let us assume that the 
functions u 1 , uo , ... are of class C {2 \[a, b ], R) and vanish at the endpoints of the 
closed interval [a, b] and that each of them satisfies the given equation with partic- 
ular values A.i, A.2, ■ • • of the coefficient a. We shall show that if a, ft a ; -, then the 
functions 11 ,■ and u ; are orthogonal on [ a, b ]. 



1 / sinfa — ft)l sin(o! + ft)l 
sin ax sin ftxdx — - 1 
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Indeed, integrating by parts, we find that 

f h 

Uj{x) dx = / Ui(x) 

J a 

According to the equation, we obtain from this the relation 



f 


T"2 +?(*) )«<(•*) 


+ q(x) )uj(x) 


dx. 


Xi{u.i, iij) = Xj(uj, u j); 

and, since A.,- ^ Xj , we now conclude that («,■ , Mj) =0. 

In particular, if < 7 ( jc ) = 0 on [a, £>] and [a,b] = [0, 7 r] we again hnd that the 
system {sinnx; n e N} is orthogonal on [0, tt]. 


Further examples, including examples of orthogonal systems of importance in 
mathematical physics, will be found in the problems at the end of this section. 


c. Orthogonalization 

It is well-known that in a finite-dimensional Euclidean space, starting with a lin- 
early independent system of vectors, there is a canonical way of constructing an 
orthogonal and even orthonormal system of vectors equivalent to the given system, 
using the Granr-Schmidt 3 orthogonalization process. By the same method one can 
obviously orthonormalize any linearly independent system of vectors \jr\, i/tx, ■ ■ ■ in 
any vector space having an inner product. 

We recall that the orthogonalization process leading to the orthonormal system 
(pi, y> 2 , ■ ■ ■ is described by the following relations: 

_ Vg _ ^2 - (to, <p\)<p\ 

n II Wl’ \\^2 - {if2,n)n\\ ’ 

C On — | 

Mn-T!k=i^l f n,n)n\\ 

Example 5 The process of orthogonalizing the linearly independent system { 1 , x, 
x 2 , . . .} in 72-2 ([ — 1, 1], R) leads to the system of orthogonal polynomials known as 
the Legendre polynomials. We note that the name Legendre polynomials is often 
given not to the orthonormal system, but to a system of polynomials proportional 


2 J.P. Gram (1850-1916) - Danish mathematician who continued the research of P.L. Chebyshev 
and exhibited the connection between orthogonal series expansions and the problem of best least- 
squares approximation (see Fourier series below). It was in these investigations that the orthogo- 
nalization process and the famous Gram matrix arose (see p. 187 and the system (18.18) on p. 504). 

3 E. Schmidt (1876-1959) - German mathematician who studied the geometry of Hilbert space in 
connection with integral equations and described it in the language of Euclidean geometry. 
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to these polynomials. The proportionality factor can be chosen from various con- 
siderations, for example, requiring the leading coefficient to be 1 or requiring the 
polynomial to have the value 1 at x = 1 . The orthogonality of the system is unaf- 
fected by these requirements, but in general orthonormality is lost. 

We have already encountered the standard Legendre polynomials, which are de- 
fined by Rodrigues’ formula 


1 d (.T 2 — 1) 

P„(x) = — . 

n\ 2 n dx n 

For these polynomials P n ( 1 ) = 1. Let us write out the first few Legendre polynomi- 
als, normalized by requiring the leading coefficient to be 1: 

FoW^l, P\(x) = x, P 2 (x) — x~ — ~, Pt,(x) = x 3 — -x. 

The orthonormalized Legendre polynomials have the form 


Pn(x) = 


y 2« + 1 


Pn(x ), 


where n — 0, 1, 2, 

One can verify by direct computation that these polynomials are orthogonal on 
the closed interval [—1, 1]. Taking Rodrigues’ formula as the definition of the poly- 
nomial P n {x), let us verify that the system of Legendre polynomials { P„ (x)} is 
orthogonal on the closed interval [—1, 1], To do this, it suffices to verify that P n {x) 
is orthogonal to l,x, x n ~ 1 , since all polynomials Pk of degree k < n are linear 
combinations of these. 

Integrating by parts for k < n, we indeed find that 



x k P n (x) d.v 


i r l d k+1 x k 

~k\ ¥ J_i dx k+l 


d n -k-\( x 2_ 1 } « 
dx n ~ k ~ 1 


d.v = 0. 


A certain picture of the origin of orthogonal systems of functions in analysis will 
be given in the last subsection of this section and in the problems at the end of the 
section. At present we shall return to the fundamental general problems connected 
with the expansion of a vector in terms of vectors of a given system of vectors in a 
vector space with inner product. 


d. Continuity of the Inner Product and the Pythagorean Theorem 

We shall have to work not only with finite sums of vectors but also with infinite sums 
(series). In this connection we note that the inner product is a continuous function, 
enabling us to extend the ordinary algebraic properties of the inner product to the 
case of series. 
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Let X be a vector space with an inner product (, } and the norm it induces ||x|| := 
*J{x, x, ) (see Sect. 10.1). Convergence of a series E/E x ‘ = x °f vectors x; e X 
to the vector x e X will be understood in the sense of convergence in this norm. 

Lemma 1 (Continuity of the inner product) Let (, ) : X C be an inner product in 
the complex vector space X. Then 

a) the function (x, y ) i->- (x, y) is continuous jointly in the two variables', 

b) ifx = E/E Xi, then (x, y) = E/E ?>; 

c) if e i,e 2 , is an orthonormal system of vectors in X and x = E/=i x'e/ 
and y = E/E y'ei, then (x, y) = E/E x‘y l . 

Proof Assertion a) follows from the Cauchy-Bunyakovskii inequality (see 
Sect 10.1): 

| (x — xq, y — y 0 ) | 2 < ||x-xo|| 2 - ||y-yoll 2 - 
Assertion b) follows from a), since 

n 

(x,y) =J2{xj,y) + 

i=i 

an( l 1 x i — > 0 as n — > oo. 

Assertion c) follows by repeated application of b), taking account of the relation 
(x,y) = {y,x). □ 

The following result is an immediate consequence of the lemma. 

Theorem 1 (Pythagoras 4 ) 

a) If{xi) is a system of mutually orthogonal vectors andx = x,-, then ||x|| 2 = 

E; II*; II 2 - 

b) If{ei} is an orthonormalized system of vectors and x = E/ x' ei, then ||x|| 2 = 

EiVl 2 - 



18.1.2 Fourier Coefficients and Fourier Series 

a. Definition of the Fourier Coefficients and the Fourier Series 

Let {e,-} be an orthonormal system and {/, } an orthogonal system of vectors in a 
space X with inner product (, ). 


4 Pythagoras of Samos (conjectured to be 580-500 BCE) - famous ancient Greek mathematician 
and idealist philosopher, founder of the Pythagorean school, which, in particular, made the dis- 
covery that the side and diagonal of a square are incommensurable, a discovery that disturbed the 
ancients. The classical Pythagorean theorem itself was known in a number of countries long before 
Pythagoras (possibly without proof, to be sure). 
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Suppose that x — x'lj. The coefficients x‘ in this expansion of the vector x 
can be found directly: 



If lj = a , the expression becomes even simpler: 

x' — {x, e,). 

We remark that the formulas for x' make sense and are completely determined 
if the vector x itself and the orthogonal system {/, } (or {e, }) are given. The equal- 
ity x = J2iX l li ( or x — ^jX’ei) is no longer needed to compute x‘ from these 
formulas. 

Definition 5 The numbers { } are the Fourier coefficients of the vector x e X 
in the orthogonal system {/, }. 

If the system {e, } is orthonormal, the Fourier coefficients have the form {(x, e,}}. 

From the geometric point of view the i th Fourier coefficient (x , e, > of the vector 
x e X is the projection of that vector in the direction of the unit vector e,- . In the 
familiar case of three-dimensional Euclidean space E~ with a given orthonormal 
frame e\, ei, ej, the Fourier coefficients x‘ — (x , e, ) , i = 1, 2, 3, are the coordinates 
of the vector x in the basis e \ , e-±, ej, appearing in the expansion x — x x e\ + x 2 e 2 + 

3 

x e 3 . 

If we were given only the two vectors e\ and ej instead of all three e \ . ei , c' 3 , the 
expansion x = x x e\ + x 2 e 2 in this system would certainly not be valid for all vec- 
tors x e E 3 . Nevertheless, the Fourier coefficients x l = (x, ef), i — 1, 2, would be 
defined in this case and the vector x e — x 1 e\ + x 2 e 2 would be the orthogonal projec- 
tion of the vector x onto the plane L of the vectors e\ and ej. Among all the vectors 
in that plane, the vector x e is distinguished by being closest to x in the sense that 
|| x — y|| > || x — x e \\ for any vector y e L. This is the remarkable extremal property 
of the Fourier coefficients, to which we shall return below in the general situation. 

Definition 6 If A is a vector space with inner product (, } and l\, I 2 , . . ■ , l n , ■ ■ ■ is 
an orthogonal system of nonzero vectors in X, then for each vector x e X one can 
form the series 

^ {x, h) 

1 ’ 

This series is the Fourier series of x in the orthogonal system {/<. }. 

If the system {4} is finite, the Fourier series reduces to its finite sum. 

In the case of an orthonormal system {e^} the Fourier series of a vector x e X 
has a particularly simple expression: 

CXD 

y^( x,e k )e k . 
k=l 


X 


(18.8') 
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Example 6 Let X — 1Z2([—tc, tt], R). Consider the orthogonal system 

{1, cos kx, sinkx; k e N} 

of Example 1. To the function / e 'IZjil—n, re], K) there corresponds a Fourier 
series 

/~ +^2 a k(f)coskx + b k (f)sinkx 

k= 1 

in this system. The coefficient \ is included in the zeroth term so as to give a uni- 
fied appearance to the following formulas, which follow from the definition of the 
Fourier coefficients: 


ak(f) = 

L 

r* 

1 f(x)coskxdx, 

' —7T 

k = 0, 1,2, ... 

(18.9) 

M/) = 

T 

r 

/ fix) sin kx dr, 

/ —71 

k= 1,2,.... 

(18.10) 


Let us set f(x) — x. Then = 0, k — 0, 1,2, , and b k = (— 1)* +1 1, k = 
1,2, Hence in this case we obtain 


f(x) = x ~ ^(— 1)* +1 - sin kx. 


k= l 


Example 7 Let us consider the orthogonal system {e lkx : k e Z} of Example 1 in 
the space 1Z2([—tt, : r], C). Let f e TZ 2 ([—tt, tt], C). According to Definition 5 and 
relations (18.4), the Fourier coefficients {c>(/)j of / in the system {s! kx } are ex- 
pressed by the following formula: 


Comparing Eqs. (18.9), (18.10), and (18.11) and taking account of Euler’s for- 
mula e l(p = cos (p + i sin qj, we obtain the following relations between the Fourier 
coefficients of a given function in the trigonometric systems written in real and 
complex forms: 


| ^(ak — ibk ), if k > 0, 
[ \{a-k + ib-k), if k < 0. 


(18.12) 


In order that the case k — 0 not be an exception in formulas (18.9) and (18.12), 
it is customary to use <7o to denote not the Fourier coefficient itself, but rather its 
double, as was done above. 
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b. Basic General Properties of Fourier Coefficients and Series 

The following geometric observation is key in this section. 

Lemma (Orthogonal complement) Let {4} be a finite or countable system of 
nonzero pairwise orthogonal vectors in X, and suppose the Fourier series of x e X 
in the system {4} converges to xi e X. 

Then in the representation x — xi + h the vector h is orthogonal to xp, moreover, 
h is orthogonal to the entire linear subspace generated by the system of vectors {4}, 
and also to its closure in X. 

Proof Taking account of the properties of the inner product, we see that it suffices 
to verify that ( h , l m ) = 0 for every l m e {4}- 
We are given that 



Hence 



□ 


Geometrically this lemma is transparent, and we have already essentially pointed 
it out when we considered a system of two orthogonal vectors in three-dimensional 
Euclidean space in Sect. 18.1.2a. 

On the basis of this lemma we can draw a number of important general conclu- 
sions on the properties of Fourier coefficients and Fourier series. 


Bessel’s Inequality 

Taking account of the orthogonality of the vectors x/ and h in the decomposition 
x = xi + h, we find by the Pythagorean theorem that \\x\\ 2 = ||x/|| 2 + \\h || 2 > ||x/|| 2 
(the hypotenuse is never smaller than the leg). This relation, written in terms of 
Fourier coefficients, is called Bessel’s inequality. 

Let us write it out. By the Pythagorean theorem 



(18.13) 


Hence 



(18.14) 
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This is Bessel's inequality. It has a particularly simply appearance for an or- 
thonormal system of vectors { e k } : 

I>,*>| 2 <ll*ll 2 . (18-15) 

k 

In terms of the Fourier coefficients a * themselves Bessel’s inequality (18.14) can 
be written as I 2 II II 2 < ||x|| 2 , which in the case of an orthonormal system 

reduces to lo;* | 2 < ||x|| 2 . 

We have included the absolute value sign in the Fourier coefficient, since we are 
allowing complex vectors spaces X. In this case the Fourier coefficient may assume 
complex values. 

We note that in deriving Bessel’s inequality we made use of the assumption that 
the vector x/ exists and that Eq. (18.13) holds. But if the system {4} is finite, there 
is no doubt that the vector x; does exist (that is, that the Fourier series converges 
in X ). Hence inequality (18.14) holds for every finite subsystem of {/*}, and then it 
must hold for the whole system as well. 


Example 8 For the trigonometric system (see formulas (18.9) and (18.10)) Bessel’s 
inequality has the form 

la ° ( f )l +f>(/)r+M/)| 2 <- f l/| 2 (x)dx. (18.16) 

Z k= l n 71 

For the system { e' ^ ; k e Z} (see formula (18.11)) Bessel’s inequality can be 
written in a particularly elegant form: 


+oo 


— oo 



(18.17) 


Convergence of Fourier Series in a Complete Space 

Suppose x^t = ^(x, ek)ek is the Fourier series of the vector x e X in the 
orthonormal system {e k }. By Bessel’s inequality (18.15) the series \x k \ 2 con- 
verges. By the Pythagorean theorem 

| x m e m + • • • + x"e„ || 2 = \x m | 2 + • • • + \x n | 2 . 

By the Cauchy convergence criterion for a series, the right-hand side of this 
equality becomes less than any e > 0 for all sufficiently large values of m and n > m. 
Hence we then have 


X Cm + h X C77 II c . 
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Consequently, the Fourier series x k e k satisfies the hypotheses of the Cauchy 
convergence criterion for series and therefore converges provided the original space 
X is complete in the metric induced by the norm ||x|| = *J(x, x ). 

To simplify the writing we have carried out the reasoning for a Fourier series in 
an orthonormal system. But everything can be repeated for a Fourier series in any 
orthogonal system. 


The Extremal Property of the Fourier Coefficients 

We shall show that if the Fourier series ^2 k x k ek = ^2k (e k ef) ek t ^ le vector x e X 
in the orthonormal system { e k 1 converges to a vector x\ e X. then the vector x/ 
is precisely the one that gives the best approximation of x among all vectors y — 
IIaLi a k e k of the space L spanned by {ek}, that is, for every y e L, 

\\x-xi\\ < \\x — V 1 1 , 

and equality holds only for v = xi. 

Indeed, by the orthogonal complement lemma and the Pythagorean theorem, 

II* - y \\ 2 = II (* -*/) + (xi - y)||' = \\h + (xi - y )|| 2 = 

= ll^ll 2 + II*/ - Vll 2 > l|/!|| 2 = II* - */H 2 - 


Example 9 Digressing slightly from our main purpose, which is the study of ex- 
pansions in orthogonal systems, let us assume that we have an arbitrary system of 
linearly independent vectors x\, ... ,x n in X and are seeking the best approxima- 
tion of a given vector xeXby linear combinations Yl'k= l a kXk of vectors of the 
system. Since we can use the orthogonalization process to construct an orthonormal 
system e\, ... ,e n that generates the same space L that is generated by the vectors 
x i , ... ,x n , we can conclude from the extremal property of the Fourier coefficients 
that there exists a unique vector xi e L such that ||x — x/|| = inf v(E / ||x — V 1 1 - Since 
the vector h = x — x; is orthogonal to the space L , from the equality x/ + h — x we 
obtain the system of equations 


{x\,x\)ai H h {x n ,xi )a n — {x,x\), 

{x\,x n )ui H h { x n ,x n )a n — {x,x n ) 


(18.18) 


for the coefficients ai, ... ,a n of the expansion xi = Y2k=i a k x k of the unknown 
vector xi in terms of the vectors of the system x \ , . . . , x n . The existence and unique- 
ness of the solution of this system follow from the existence and uniqueness of the 
vector xi . In particular, it follows from this by Cramer’s theorem that the determi- 
nant of this system is nonzero. In other words, we have shown as a by-product that 
the Gram determinant of a system of linearly independent vectors is nonzero. 
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This approximation problem and the system of Eqs. (18.18) corresponding to it 
arise, as we have already noted, for example, in processing experimental data by 
Gauss’ least-squares method. (See also Problem 1.) 

c. Complete Orthogonal Systems and Parseval’s Equality 

Definition 7 The system {x a ', a e A} of vectors of a normed space X is complete 
with respect to the set E C X (or complete in E) if every vector x e E can be 
approximated with arbitrary accuracy in the sense of the norm of X by finite linear 
combinations of vectors of the system. 

If we denote by L{x a ) the linear span in X of the vectors of the system (that is, 
the set of all finite linear combinations of vectors of the system). Definition 7 can be 
restated as follows: 

The system {x a } is complete with respect to the set E C X if E is contained in 
the closure L{x„} of the linear span of the vectors of the system. 

Example 10 If X — £ 3 and e\ , ej, e$ is a basis in Zs 3 , then the system {ey , e 2 , £ 2 } is 
complete in X. The system {e\, e 2 } is not complete in X, but it is complete relative 
to the set L{e 1 , c' 2 ) or any subset E of it. 

Example 11 Let us regard the sequence of functions 1. x, x 2 , ... as a system of 
vectors {x k \ k = 0, 1, 2, . . .} in the space lZ 2 ([a, b], R) or lZ 2 ([a, b], C). If C[a, b] 
is a subspace of the continuous functions, then this system is complete with respect 
to the set C[a, b]. 

Proof Indeed, for any function / e C[a, b] and for every number e > 0, the Weier- 
strass approximation theorem implies that there exists an algebraic polynomial Pix) 
such that max xe [ a b] \f(x) — P{x)\ < s. But then 



dx < s -Jh — a 


11 / ~P 


and hence one can approximate the function /' in the sense of the norm of the space 


□ 


lZ 2 ([a, b ]) with arbitrary accuracy. 


We note that, in contrast to the situation in Example 9, in the present case not 
every continuous function on the closed interval [a, b] is a finite linear combination 
of the functions of this system; rather, such a function can only be approximated 
by such linear combinations. Thus C[a, b] C L{x"} in the sense of the norm of the 
space lZ 2 [a, b]. 


Example 12 If we remove one function, for example the function 1, from the system 
{1, cos kx, sinfcx; k e N}, the remaining system {cos£x, sin£x; k e N} is no longer 
complete in lZ 2 ([—rt, jt], C) or lZ2([—7t, 7r] , M). 
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Proof Indeed, by the extremal property of the Fourier coefficients the best approxi- 
mation of the function f(x) = 1 among all the finite linear combinations 

n 

T n O) = cos kx + bk sin kx) 

k=\ 


of any length n is given by the trigonometric polynomial T n (x) in which ak and bk 
are the Fourier coefficients of the function 1 with respect to the orthogonal system 
{ cos kx , sin kx: k e N}. But by relations (18.5), such a polynomial of best approxi- 
mation must be zero. Hence we always have 


1 -Tn 



1 dx = V2jt > 0, 


and it is impossible to approximate 1 more closely than \f2rr by linear combinations 
of functions of this system. □ 


Theorem (Completeness conditions for an orthogonal system) Let X be a vector 
space with inner product (, }, and l\, I 2 , . . . ,l n , ■ ■ • a finite or countable system of 
nonzero pairwise orthogonal vectors in X. Then the following conditions are equiv- 
alent: 

a) the system {Ik} is complete with respect to the set 5 E C X; 

b) for every vector x G E C X the following ( Fourier series) expansion holds: 

c) for every vector x E E C X Parseval’s 6 equality holds: 


2 _ I (x, Ik) | 2 

“Y {k ’ k) ' 


(18.20) 


Equations (18.19) and (18.20) have a particularly simple form in the case of an 
orthonormal system {ek}. In that case 

x = ^2{x,ek)ek (18.19') 

k 

and 

M = £|(x, e *>| 2 . (18.20') 

k 


5 The set E may, in particular, consist of a single vector that is of interest for one reason or another. 

6 M.A. Parseval (1755-1836) - French mathematician who discovered this relation for the trigono- 
metric system in 1799. 
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Thus the important Parseval equality (18.20) or (18.20') is the Pythagorean the- 
orem written in terms of the Fourier coefficients. 

Let us now prove this theorem. 

Proof a) =>• b) by virtue of the extremal property of Fourier coefficients; 

b) =>■ c) by the Pythagorean theorem; 

c) =>■ a) since by the lemma on the orthogonal complement (see Sect, b) above) 
the Pythagorean theorem implies 







□ 


Remark We note that Parseval’s equality implies the following simple necessary 
condition for completeness of an orthogonal system with respect to a set E C X : E 
does not contain a nonzero vector orthogonal to all the vectors in the system. 

As a useful supplement to this theorem and the remark just made, we prove the 
following general proposition. 

Proposition Let X be a vector space with an inner product and X\,X 2 , a system 
of linearly independent vectors in X. In order for the system {xk} to be complete 
in X, 

a) a necessary condition is that there be no nonzero vector in X orthogonal to 
all the vectors in the system ; 

b) ifX is a complete ( Hilbert ) space, it suffices that X contain no nonzero vector 
orthogonal to all the vectors in the system. 

Proof a) If the vector h is orthogonal to all the vectors in the system {xk}, we con- 
clude by the Pythagorean theorem that no linear combination of vectors in the sys- 
tem can differ from h by less than \\h\\. Hence, if the system is complete, then 
11*11 = 0 . 

b) By the orthogonalization process we can obtain an orthonormal system { ek } 
whose linear span L{ek\ is the same as the linear span L{xk} of the original system. 

We now take an arbitrary vector x e X. Since the space X is complete, the Fourier 
series of x in the system {e*} converges to a vector x e e X. By the lemma on the 
orthogonal complement, the vector li = x — x e is orthogonal to the space L{ek] = 
L{xk}. By hypothesis h = 0, so that x = x e , and the Fourier series converges to the 
vector x itself. Thus the vector x can be approximated arbitrarily closely by finite 
linear combinations of vectors of the system {e*} and hence also by finite linear 
combinations of the vectors of the system {xk}- □ 

The hypothesis of completeness in part b) of this proposition is essential, as the 
following example shows. 
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Example 13 Consider the space h (see Sect. 10.1) of real sequences a = (a 1 , a 2 , . . .) 
for which YL fL\ (« 2 ) 2 < oo. We define the inner product of the vectors a = 
(a 1 , a 2 , . . .) and b = (b l , b 2 , . . .) in I 2 in the standard way: (a, b) := 1 b’ . 

Now consider the orthonormal system ej = (0, . . . , 0, 1, 0, 0, . . .), k = 1, 2, 

k 

The vector eo = (1, 0, 0, . . .) does not belong to this system. We now add to the 
system {ek\ k e N} the vector e = (1, 1/2, 1/2 2 , 1/2 3 , . . .) and consider the linear 
span L{e, e\, 62 , . . .} of these vectors. We can regard this linear span as a vector 
space X (a subspace of 1 2 ) with the inner product from h. 

We note that the vector eo = (1, 0, 0, . . .) obviously cannot be obtained as a finite 
linear combination of vectors in the system e, e\, e 2 , . . . , and therefore it does not 
belong to X. At the same time, it can be approximated as closely as desired in I 2 by 
such linear combinations, since e — YHk=\ ^k e k = (1, 0, . . . , 0, ^pr, ■ ■ ■)• 

Hence we have established simultaneously that X is not closed in h (and there- 
fore X, in contrast to h, is not a complete metric space) and that the closure of X in 
I 2 coincides with h, since the system eo, e\, e 2 , ■ ■ ■ generates the entire space h. 

We now observe that in X — L{e, e \ , e 2 , ■ ■ •} there is no nonzero vector orthogo- 
nal to all the vectors e \ , e 2 , 

Indeed, let x e X, that is, x — cue + Ylk= 1 a k e k . and let (x, e^) — 0, k = 1, 2, 

Then ( x , e n+ \ ) = = 0, that is, a — 0. But then = {x, e^) = 0, k — 1, . . . , n. 

Hence we have constructed the required example: the orthogonal system 
ei, C 2 , . . . is not complete in X , sine it is not complete in the closure of X, which 
coincides with 12 - 

This example is of course typically infinite-dimensional. Figure 18.1 represents 
an attempt to illustrate what is going on. 

We note that in the infinite-dimensional case (which is so characteristic of analy- 
sis) the possibility of approximating a vector arbitrarily closely by linear combina- 
tions of vectors of a system and the possibility of expanding the vector in a series of 
vectors of the system are in general different properties of the system. 

A discussion of this problem and the concluding Example 14 will clarify the 
particular role of orthogonal systems and Fourier series for which these properties 
hold or do not hold simultaneously (as the theorem proved above shows). 

Definition 8 The system x\, X 2 , ■ ■ ■ , x n , . . . of vectors of a normed vector space X 
is a basis ofX if every finite subsystem of it consists of linearly independent vectors 
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and every vector x e X can be represented as x = oL^Xk, where a k are coeffi- 
cients from the scalar field of X and the convergence (when the sum is infinite) is 
understood in the sense of the norm on X . 

How is the completeness of a system of vectors related to the property of being 
a basis? 

In a finite-dimensional space X completeness of a system of vectors in X, as 
follows from considerations of compactness and continuity, is obviously equivalent 
to being a basis in X. In the infinite-dimensional case that is in general not so. 

Example 14 Consider the set C([— 1, 1], R) of real-valued functions that are con- 
tinuous on [— 1 , 1] as a vector space over the field R with the standard inner product 
defined by (18.3). We denote this space by C 2 ([— 1, 1], R) and consider the system 
of linearly independent vectors 1, x, x 2 , ... in it. 

This system is complete in C 2 ([— 1 , 1], M) (see Example 1 1), but is not a basis. 

Proof We first show that if the series YlT=o a k xk converges in C? 2 ([ — 1, 1], R), that 
is, in the mean-square sense on [—1, 1], then, regarded as a power series, it con- 
verges pointwise on the open interval ]— 1, 1[. 

Indeed, by the necessary condition for convergence of a series, we have 
||of*Jc* || -> 0 as k — > 00 . But 



Hence \otk\ < y/2 k + 1 for all sufficiently large values of k. In that case the power 
series Y^icLo a k xk definitely converges on the interval ]— 1, 1[. 

We now denote the sum of this power series on ] — 1, 1[ by (p. We remark that 
on every closed interval [«, Z?] C ]— 1, 1[ the power series converges uniformly to 
<p\ [ a ,b\- Consequently it also converges in the sense of mean-square deviation. 

It now follows that if a continuous function / is the sum of this series in the sense 
of convergence in C 2 ([ — 1, I ], M), then / and <p are equal on ]— 1, 1[. But the func- 
tion cp is infinitely differentiable. Hence if we take any function in C 2 ([ — 1 , 1 ], R) 
that is not infinitely differentiable on ] — 1 , 1 [ it cannot be expanded in a series in the 
system [x k \ k — 0, 1, .. .}. □ 

Thus, if we take, for example, the function x = \x\ and the sequence of numbers 
{e„ = n e N}, we can construct a sequence {P n (x)\ n e N} of finite linear combi- 
nations P n (x) = ao + a\x + • • • + a n x n of elements of the system {x k ; ieN) such 
that || / — Pfj || , that is, Pn y f as n ^ 00 . If necessary, one could assume that 

in each such linear combination P n (x) the coefficients can be assumed to have been 
chosen in the unique best-possible way (see Example 9). Nevertheless, the expan- 
sion / = a kx k will not arise since in passing from P n (x) to P n+ i(x), not only 
the coefficient 1 changes, but also possibly the coefficients oro, . . . , or,,. 
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If the system is orthogonal, this does not happen ( ao , ....«„ do not change) 
because of the extremal property of Fourier coefficients. 

For example, one could pass from the system of monomials {jc a } to the orthog- 
onal system of Legendre polynomials and expand f(x ) = \x\ in a Fourier series in 
that system. 


18.1.3 *An Important Source of Orthogonal Systems of Functions 
in Analysis 


We now give an idea as to how various orthogonal systems of functions and Fourier 
series in those systems arise in specific problems. 


Example 15 (The Fourier method) Let us regard the closed interval [0, /] as the 
equilibrium position of a homogeneous elastic string fastened at the endpoints of 
this interval, but otherwise free and capable of making small transverse oscillations 
about this equilibrium position. Let u(x, t) be a function that describes these oscil- 
lations, that is, at each fixed instant of time t — to the graph of the function u(x, to) 
over the closed interval 0 < x < l gives the shape of the string at time to- This in 
particular, means that m(0, t) = u(l, t) — 0 at every instant t, since the ends of the 
string are clamped. 

It is known (see for example Sect. 14.4) that the function u (x , t) satisfies the 
equation 


3 2 m i d 2 u 

3f2 


(18.21) 


where the positive coefficient a depends on the density and elastic constant of the 
string. 

Equation (18.21) alone is of course insufficient to determine the function u(x, t ). 
From experiment we know that the motion u(x,t) is uniquely determined if, for 
example, we prescribe the position u (x, 0) = (p(x) of the string at some time t — 0 
(which we shall call the initial instant) and the velocity |^(x,0) = I/'T x ) of the 
points of the string at that time. Thus, if we stretch the string into the shape (p(x) 
and let it go, then \f/(x) = 0. 

Hence the problem of free oscillations of the string 7 that is fixed at the ends of 
the closed interval [0, /] has been reduced to finding a solution u (x , t) of Eq. (18.21) 
together with the boundary conditions 


u{ 0, t) = u(l,t) — 0 


(18.22) 


7 We note that the foundations of the mathematical investigation of the oscillations of a string were 
laid by Brook Taylor. 
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and the initial conditions 


u(x , 0) = <p(: x). 


du 

~dt 


( X , 0 ) = 


(18.23) 


To solve such problems there exists a very natural procedure called the method 
of separation of variables or the Fourier method in mathematics. It consists of the 
following. The solution u(x,t) is sought in the form of a series X n (x)T„(t) 

whose terms X ( x)T(t ) are solutions of an equation of special form (with variables 
separated) and satisfy the boundary conditions. In the present case, as we see, this 
is equivalent to expanding the oscillations u(x, t ) into a sum of simple harmonic 
oscillations (more precisely a sum of standing waves). 

Indeed, if the function X(x)T(t) satisfies Eq. (18.21), then X(x)T"(t) = 
a 2 X"{x)T (t), that is, 


T"(t) X"(x) 

a 2 T(t) ~ X(x) ' 


(18.24) 


In Eq. (18.24) the independent variables x and t are on opposite sides of the 
equation (they have been separated), and therefore both sides actually represent the 
same constant X. If we also take into account the boundary conditions X(0)T(t) = 
X(l)T (t) = 0 that the solution of stationary type must satisfy, we see that finding 
such a solution reduces to solving simultaneously the two equations 


T"{t)=Xa 2 T{t), (18.25) 

X"(x) = XX (x) (18.26) 


under the condition that X (0) = X (l) = 0. 

It is easy to write the general solution of each of these equations individually: 

T(t) — A cos VXat + BsinVXat, (18.27) 

Xfx) — C cos VXx + D sin VXx. (18.28) 


If we attempt to satisfy the conditions X (0) = X (l) = 0, we find that for X f 0 we 
must have C — 0, and, rejecting the trivial solution D = 0, we find that sin y/Xl = 0, 
from which we find y/X = ±nrc/l, n e N. 

Thus it turns out that the number X in Eqs. (18.25) and (18.26) can be chosen only 
among a certain special series of numbers (the so-called eigenvalues of the problem ), 
X n — ( njt/l ) 2 , where n e N. Substituting these values of X into the expressions 
(18.27) and (18.28), we obtain a series of special solutions 


it 


l 


na 


it a 


u n (x , t ) = sinw— xl A n cos n — t + B n sin n — t 


l 


(18.29) 


satisfying the boundary conditions u n ( 0 , t) = u„(l, t) = 0 (and describing a stand- 
ing wave of the form 0(x) ■ sin(a>f + 6), in which each point x e [0, /] undergoes 
simple harmonic oscillations with its own amplitude <P (x ) but the same frequency 
cu for all points). 


512 


1 8 Fourier Series and the Fourier Transform 


The quantities co n = n n e N, are called, for natural reasons, the natural fre- 
quencies of the string, and its simplest harmonic oscillations (18.29) are called the 
natural oscillations of the string. The oscillation ui(x, t ) with smallest natural fre- 
quency is often called the fundamental tone of the string and the other natural fre- 
quencies U 2 (x, t), uffx, t), . . . are called overtones (it is the overtones that form 
the sound quality, called the timbre, characteristic of each particular musical instru- 
ment). 

We now wish to represent the oscillation u(x,t) we are seeking as a sum 
\ Unix, t) of the natural oscillations of the string. The boundary conditions 
(18.22) are automatically satisfied in this case, and we need worry only about the 
initial conditions (18.23), which mean that 


OO 

E l T 

A n sin /z — x 

n = 1 


(18.30) 


and 

CO 

L it ci 7i 

n — B n sinn — x. (18.31) 

n— 1 

Thus the problem has been reduced to finding the coefficients A n and B n , which 
up to now have been free, or, what is the same, to expanding the functions <p and f 
in Fourier series in the system {sinn fx; n e N}, which is orthogonal on the interval 
[ 0 ,/]. 


It is useful to remark that the functions {sin«yx; n e N ) , which arose from 

Eq. (18.26) can be regarded as eigenvectors of the linear operator A = -jj^ corre- 
sponding to the eigenvalues k„ = n j , which in turn arose from the condition that the 
operator A acts on the space of functions in C (2) [0, /] that vanish at the endpoints 
of the closed interval [0,/]. Hence Eqs. (18.30) and (18.31) can be interpreted as 
expansions in eigenvectors of this linear operator. 

The linear operators connected with particular problems are one of the main 
sources of orthogonal systems of functions in analysis. 

We now recall another fact known from algebra, which reveals the reason why 
such systems are orthogonal. 

Let Z be a vector space with inner product (, }, and let £ be a subspace (possibly 
equal to Z itself) that is dense in Z. A linear operator A \ E —*■ Z is symmetric 
if (Ax, y) = (x, Ay) for every pair of vectors x,y e E. Then: eigenvectors of a 
symmetric operator corresponding to different eigenvalues are orthogonal. 


Proof Indeed, if Au = au and Av = fv, and a ^ f, then 

a(u, v) = (Au, v) — (u, Av) = f (u, v), 

from which it follows that (u, v) = 0. □ 
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It is now useful to look at Example 3 from this point of view. There we were 
essentially considering the eigenfunctions of the operator A — (-j^ + q (x ) ) operat- 
ing on the space of functions in C l2) [«, /;] that vanish at the endpoints of the closed 
interval [a, ft]. Through integration by parts one can verify that this operator is sym- 
metric on this space (with respect to the standard inner product (18.4)), so that the 
result of Example 4 is a particular manifestation of this algebraic fact. 

In particular, when q(x) = 0 the operator A becomes 4j-j, which for [a , b] = 
[0, l] occurred in the last example (Example 15). 

We note also that in this example the question reduced to expanding the functions 
<p and if (see relations (18.30) and (18.31)) in a series of eigenfunctions of the oper- 
ator A — . Here of course the question arises whether it is theoretically possible 

to form such an expansion, and this question is equivalent, as we now understand, 
to the question of the completeness of the system of eigenfunctions for the operator 
in question in the given space of functions. 

The completeness of the trigonometric system (and certain other particular sys- 
tems of orthogonal functions) in 'R,2[—7t, it] seems to have been stated explicitly 
for the first time by Lyapunov. 8 The completeness of the trigonometric system 
in particular was implicitly present in the work of Dirichlet devoted to studying 
the convergence of trigonometric series. Parseval’s equality, which is equivalent to 
completeness for the trigonometric system, as already noted, was discovered by 
Parseval at the turn of the nineteenth century. In its general form, the question of 
completeness of orthogonal systems and their application in the problems of math- 
ematical physics were one of the main subjects of the research of Steklov, 9 who 
introduced the very concept of completeness (closedness) of an orthogonal sys- 
tem into mathematics. In studying completeness problems, by the way, he made 
active use of the method of integral averaging (smoothing) of a function (see 
Sects. 17.4 and 17.5), which for that reason is often called the Steklov averaging 
method. 


18.1.4 Problems and Exercises 

1. The method of least squares. The dependence y = f(xi , ■ . . , x„) of the quantity 
y on the quantities x\,...,x n is studied experimentally. As a result of m (> n) 


8 A.M. Lyapunov (1857-1918) - Russian mathematician and specialist in mechanics, a brilliant 
representative of the Chebyshev school, creator of the theory of stability of motion. He successfully 
studied various areas of mathematics and mechanics. 

9 V.A. Steklov (1864-1926) - Russian/Soviet mathematician, a representative of the Petersburg 
mathematical school founded by Chebyshev and founder of the school of mathematical physics in 
the USSR. The Mathematical Institute of the Russian Academy of Sciences bears his name. 
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experiments, a table was obtained 


XI 

X2 ■ 

• X n 

y 

a \ 

a 2 

■ a n 

b 1 

a'[ l 

a m • 

■ < 

b m 


each of whose rows contains a set {a\,a l 2 , , a l n ) of values of the parameters 
xi, X 2 , ■ ■ ■ , x n and the value b‘ of the quantity y corresponding to them, measured by 
some device with a certain precision. From these experimental data we would like to 
obtain an empirical formula of the form y = Y^i=\ a i x i convenient for computation. 
The coefficients ari, <* 2 , . . . , a n of the required linear function are to be chosen so 

as to minimize the quantity ^Yl'k=i^ k ~~ Ylk= 1 which is the mean-square 

deviation of the data obtained using the empirical formula from the results obtained 
in the experiments. 

Interpret this problem as the problem of best approximation of the vector 
(ft 1 , , b m ) be linear combinations of the vectors (aj i = l, ... ,n and 
show that the question reduces to solving a system of linear equations of the same 
type as Eq. (18.18). 

2. a) Let C[a,b] be the vector space of functions that are continuous on the closed 
interval [a, b] with the metric of uniform convergence and C 2 D 2 , b] the same vector 
space but with the metric of mean-square deviation on that closed interval (that is, 

d(f,g ) = yj \ f — g\ 2 (x ) cLy). Show that if functions converge in C[a,b], they 
also converge in C2[a,b], but not conversely, and that the space C2[a,b] is not 
complete, in contrast to C[a,b]. 

b) Explain why the system of functions {l,x,x 2 , ...} is linearly independent 
and complete in C2[a, b], but is not a basis in that space. 

c) Explain why the Legendre polynomials are a complete orthogonal system and 
also a basis in C 2 [— 1, 1]. 

d) Find the first four terms of the Fourier expansion of the function sin nx on 
the interval [— 1, 1] in the system of Legendre polynomials. 

e) Show that the square of the norm ||P„|| in C 2 [— 1, 1] of the nth Legendre 
polynomial is 


2 

2n+ 1 



(n + 1 ) (n + 2) • • • 2 n 
n\ 2 2n 


J (-* 2 ~ l)"dx^. 


f) Prove that among all polynomials of given degree n with leading coefficient 1, 
the Legendre polynomial P n (x) is the one closest to zero on the interval [—1, 1], 

g) Explain why the equality 



/l 2 (.v)d, = g(„+l)|/ l | 


f(x)P n (x)dx 


2 
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where {Pq, P \, . . .} is the system of Legendre polynomials, necessarily holds for 
every function f e C 2 ([— 1, 1], C). 

3. a) Show that if the system {xi,X 2 , . . .} of vectors is complete in the space X and 
X is an everywhere-dense subset of Y, then {xi, X 2 , . . .} is also complete in Y. 

b) Prove that the vector space C[a, b] of functions that are continuous on the 
closed interval [ a , b ] is everywhere dense in the space 7^2 [fl, b], (It was asserted in 
Problem 5g of Sect. 17.5 that this is true even for infinitely differentiable functions 
of compact support on [a, b].) 

c) Using the Weierstrass approximation theorem, prove that the trigonometric 
system {1, cos kx, sin Lx; k e N} is complete in 'Ral.—n, 7r]. 

d) Show that the systems {1, x, x 2 , . . .} and {1, cos kx, sinLx; left} are both 
complete in 72-2 [ — tt, tc ], but the first is not a basis in this space and the second is. 

e) Explain why Parseval’s equality 



holds, where the numbers ak and are defined by (18.9) and (18.10). 

f) Using the result of Example 8, now show that -K = \- 

4 . Orthogonality with a weight function. 

a) Let po, pi, , p n be continuous functions that are positive in the domain D. 
Verify that the formula 



defines an inner product in C (n) (D, C). 

b) Show that when functions that differ only on sets of measure zero are identi- 
fied, the inner product 



involving a positive continuous function p can be introduced in the space 'JZ(f). C). 

The function p here is called a weight function, and if (/, g) = 0, we say that the 
functions f and g are orthogonal with weight p. 

c) Let tp : D — > G be a diffeomorphism of the domain D C R" onto the domain 
G c R", and let {uk(y)', k e N} be a system of functions in G that is orthogonal 
with respect to the standard inner product (18.2) or (18.3). Construct a system of 
functions that are orthogonal in D with weight pix) = \ det^'(x) and also a system 
of functions that are orthogonal in D in the sense of the standard inner product. 

d) Show that the system of functions {e„ un (x, y) = e' ( mx +ny) ; n g pj} i s or- 
thogonal on the square I — {(x, y) e R 2 | |x| < n A |y| < tc}. 
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e) Construct a system of functions orthogonal on the two-dimensional torus 
T 2 C R 2 defined by the parametric equations given in Example 4 of Sect. 12.1. 
The inner product of functions / and g on the torus is understood as the surface 
integral j j2 fg da. 

5. a) It is known from algebra (and we have also proved it in the course of dis- 
cussing constrained extremal problems) that every symmetric operator A : E" — »■ 
E n on n -dimensional Euclidean space E n has nonzero eigenvectors. In the infinite- 
dimensional case this is generally not so. 

Show that the linear operator fix) — > xf (x ) of multiplication by the indepen- 
dent variable is symmetric in Cj ([a, b], E), but has no nonzero eigenvectors. 

b) A Sturm l0 -Liouville problem that often arises in the equations of math- 
ematical physics is to find a nonzero solution of an equation u"{x) + \qix) + 
Xp (x)]u(x) — 0 on the interval [a, /;] satisfying certain boundary conditions, for 
example u(a ) — u{b) — 0. 

Here it is assumed that the functions p(x) and q (x ) are known and continuous 
on the interval («, /;] in question and that p (x ) > 0 on [a , b] . 

We have encountered such a problem in Example 15, where it was necessary 
to solve Eq. (18.26) under the condition X (0) = X(l) — 0. In this case we had 
q{x) = 0, p(x) = 1, and [a, /;] = [0, /]. We have verified that a Sturm-Liouville 
problem may in general turn out to be solvable only for certain special values of 
the parameter X, which are therefore called the eigenvalues of the corresponding 
Sturm-Liouville problem. 

Show that if the functions / and g are solutions of a Sturm-Liouville problem 
corresponding to eigenvalues Xf X g , then the equality ^ig' f — fg) — (Xf — 
X g )pfg holds on [a . b] and the functions / and g are orthogonal on [a. h] with 
weight p. 

c) It is known (see Sect. 14.4) that the small oscillations of an inhomogeneous 
string fastened at the ends of the closed interval [a, b] are described by the equation 
ipu' x )' x = pu'f where u = u(x, t) is the function that gives the shape of the string 
at each time t,p = p (x ) is the linear density, and p = p (x ) is the elastic constant at 
the point x e [ a , b]. The clamping conditions mean that u{a, t) = u(b, t ) = 0. 

Show that if we seek the solution of this equation in the form X(x)T(t), the 
question reduces to a system T" = XT, ipX' )' = XpX, in which X is the same 
number in both equations. 

Thus a Sturm-Liouville problem arises for the function X ( x ) on the closed in- 
terval [fl,b], which is solvable only for particular values of the parameter X (the 
eigenvalues). (Assuming that p(x) > 0 on [a, b] and that p e C (1) [a, b] we can ob- 
viously bring the equation ipX')' = XpX into a form in which the first derivative is 
missing by the change of variable x = f* .) 


10 J.Ch.F. Sturm (1803-1855) - French mathematician (and, as it happens, an honorary foreign 
member of the Petersburg Academy of Sciences); his main work was in the solution of boundary- 
value problems for the equations of mathematical physics. 
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d) Verify that the operator S (u) = (p(x)u' (x)Y — q(x)u(x) on the space of func- 
tions in C (2) [fl, b ] that satisfy the condition u{a) = u(b) = 0 is symmetric on this 
space. (That is, ( Su , v) = ( u , Sv), where (, > is the standard inner product of real- 
valued functions.) Verify also that the eigenfunctions of the operator S correspond- 
ing to different eigenvalues are orthogonal. 

e) Show that the solutions X \ and Xi of the equation ( pX')' — Xp X correspond- 
ing to different values 7, i and 7.2 of the parameter X and vanishing at the endpoints 
of the closed interval [a, b ] are orthogonal on [a. b] with weight p (x ) . 

6. The Legendre polynomials as eigenfunctions. 

a) Using the expression of the Legendre polynomials P n (x) given in Example 5 
and the equality ( x 2 — 1)" = (x — l)"(x + 1)". show that P n { 1) = 1. 

b) By differentiating the identity (x 2 — llg^C* 2 — 1)” = 2 nx(x 2 — 1)”, show 
that P n (x ) satisfies the equation 

(.x 2 - l) • P"(x) + 2x • P' n (x ) - n(ti + 1 )P„(x) — 0. 


c) Verify that the operator 


A := (x 2 


. d 2 d 

‘>tf +2, di = 



i) 


d 

dx 


is symmetric in the space C^ 2) [— 1, 1] C V?. 2 ( — 1 : 1]. Then, starting from the relation 
A(P n ) = n(n + 1 ) P„ , explain why the Legendre polynomials are orthogonal. 

d) Using the completeness of the system {1, x, x 2 , . . .} in C^ 2) [— 1, 1], show that 
the dimension of the eigenspace of the operator A corresponding to the eigenvalue 
n(n+ 1) cannot be larger than 1. 

e) Prove that the operator A — ^[(x 2 — 1) cannot have eigenfunctions in the 
space C (2, [— 1, 1] except those in the system (Pq(x), P\ (x), . . .} of Legendre poly- 
nomials, nor any eigenvalues different from the number {n(n + 1); n = 0, 1, 2, . . .}. 

7. Spherical functions. 

a) In solving various problems in R 3 (for example, problems of potential theory 
connected with Laplace’s equation Au — 0) the solutions are sought in the form of 
a series of solutions of a special form. Such solutions are taken to be homogeneous 
polynomials S n (x , y, z) of degree n satisfying the equation Au — 0. Such polyno- 
mials are called harmonic polynomials. In spherical coordinates (r,ip,6) a harmonic 
polynomial S n (x, y, z ) obviously has the form r n Y n (6, ip). The functions Y n (0, tp) 
that arise in this way, depending only on the coordinates 0 < 6 < re and 0 < (p < 2 tt 
on the sphere, are called spherical functions. (They are trigonometric polynomials 
in two variables with 2n + 1 free coefficients in Y n , this number coming from the 
condition AS n =0.) 

Using Green’s formula, show that for m f n the functions Y m and Y n are or- 
thogonal on the unit sphere in IB. 3 (in the sense of the inner product { Y m , Y n ) = 
f f Y m ■ Y„ da, where the surface integral extends over the sphere r — 1). 
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b) Starting from the Legendre polynomials, one can also introduce the polyno- 

r\ dffl p 

mials P nm = (1 — x ) m/ -^jr(x), m = 1,2,,.., n , and consider the functions 

P n (cosO), Pi U m(cos6)cosnup, P n m (smd) sinnup. (*) 


It turns out that every spherical function Y n (0, <p) with index n is a linear com- 
bination of these functions. Accepting this result and taking account of the orthog- 
onality of the trigonometric system, show that the functions of the system (*) form 
an orthogonal basis in the (2 n + 1) -dimensional space of spherical functions of a 
given index n . 

8. The l lev mite polynomials. In the study of the equation of a linear oscillator in 

quantum mechanics it becomes necessary to consider functions of class C <2> (R) 
with the inner product (/, g) = fgdx in C® (R) C (R, C), and also the spe- 
cial functions H n (x) = (— 1 )"e v “ ^(e - ^*), n — 0, 1, 2, 

a) Show that Hq(x) = 1, H\(x) = 2x, ^(x) = 4x 2 — 2. 

b) Prove that H„(x ) is a polynomial of degree n. The system of functions 
{Hq(x), Hi (x), . . .} is called the system of Hermite polynomials. 

c) Verify that the function //„ (x) satisfies the equation H''(x) — 2x H' n (x) + 
2/7 H n (x ) = 0. 

_ 2 

d) The functions i//,, (x ) = H n (x) are called the Hermite functions. Show 
that is„(x) + (2n + 1 — x 2 )f n (x) — 0, and that 1 lr n (x) —*■ 0 as x — >■ 00 . 

e) Verify that \f n f,,, dx = 0 if m n . 

_ 2 

f) Show that the Hermite polynomials are orthogonal on R with weight e x . 

9. The Chebyshev—Laguerre 1 1 polynomials { L„ (x): n — 0,1,2 ,.. .} can be defined 
by the formula L n (x) := e x d<A djt e n ■ 

Verify that 

a) L n (x) is a polynomial of degree //; 

b) the function L„(x) satisfies the equation 

xL”(x) + (1 - x)L' n (x) + nL n (x) — 0; 


c) the system { L n ; n = 0. 1.2,...} of Chebyshev-Laguerre polynomials is or- 
thogonal with weight e~ A on the half-line [0, +oo[. 

10. The Chebyshev polynomials {7q(x) = 1, T n (x) = 2 1_ " cosn(arccosx); n e N} 
for | x | < 1 can be defined by the formula 


Tn(x) = 


(2/7)! dx" 


(1 



Show that: 


U E.N. Laguerre (1834-1886) - French mathematician. 
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a) T n (x) is a polynomial of degree n; 

b) T n (x) satisfies the equation 

(1 - x 2 ) T”{x) - xT' n (x) + n 2 T„ (x) = 0; 


c) the system {T„;n — 0, 1 , 2, . . .} of Chebyshev polynomials is orthogonal with 

weight p(x)= . 1 on the interval ]— 1, 1[. 

Vi-x 2 

11. a) In probability theory and theory of functions one encounters the follow- 
ing system of Rademacher 12 functions'. {i/r,,(x) — <p( 2”x); n = 0 , 1 , 2 , . . where 
(pit) = sgn(sin27rf)- Verify that this is an orthonormal system on the closed interval 
[0,1]. 

b) The system of Hoar 1 '’ functions (Xn, *(■*)}> where n = 0 , 1, 2, . . . and k = 
1 , 2, 2 2 , ... is defined by the relations 


Xn,kW = 


1 , 


it 


-1, if 


2"+ 1 

2k-l 


< X < 


2«+i 

2k 

2" +1 


2 n+l 

at all other points of [0, 1]. 


Verify that Haar system is orthogonal on the closed interval [0, 1], 

12 . a) Show that every n -dimensional vector space with an inner product is isomet- 
rically isomorphic to the arithmetic Euclidean space M" of the same dimensions. 

b) We recall that a metric space is called separable if it contains a countable 
everywhere-dense subset. Prove that if a vector space with inner product is sepa- 
rable as a metric space with the metric induced by the inner product, then it has a 
countable orthonormal basis. 

c) Let X be a separable Hilbert space (that is, X is a separable and complete 
metric space with the metric induced by the inner product in V). Taking an or- 
thonormal basis {et; i e N | in X , we construct the mapping X 3.m (cj , cj. ■ ■ .), 
where c; = (x, e, } are the Fourier coefficients of the expansion of the vector in the 
basis {e; }. Show that this mapping is a bijective, linear, and isometric mapping of X 
onto the space h considered in Example 14. 

d) Using Fig. 18.1, exhibit the basic idea of the construction of Example 14, 
and explain why it comes about precisely because the space in question is infinite- 
dimensional. 

e) Explain how to construct an analogous example in the space of functions 
C[a , b] C 72-2 [« , b]- 


12 H.A. Rademacher (1892-1969) - German mathematician (American after 1936). 
13 A. Haar (1885-1933) - Hungarian mathematician. 
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18.2 Trigonometric Fourier Series 

18.2.1 Basic Types of Convergence of Classical Fourier Series 


a. Trigonometric Series and Trigonometric Fourier Series 


A classical trigonometric series is a series of the form 14 


«0 

~2 


OO 

+ ak cos kx + bk sin kx , 
k= 1 


(18.32) 


obtained on the basis of the trigonometric system {1, cos kx , sin kx: k e N}. The 
coefficients {«o, bk\ k <= N| here are real or complex numbers. The partial sums 
of the trigonometric series (18.32) are the trigonometric polynomials 


n 

an x — > 

T n (x) = — + 2_^ a k cos kx + bk sin kx (18.33) 

2 k= l 


corresponding to degree n . 

If the series (18.32) converges pointwise on R, its sum f(x) is obviously a func- 
tion of period 2 tt on R. It is completely determined by its restriction to any closed 
interval of length 2 n . 

Conversely, given a function of period 2tc on R (oscillations, a signal, and the 
like) that we wish to expand into a sum of certain canonical periodic functions, the 
first claimants for such canonical status are the simplest functions of period 2 jt, 
namely {1, cos kx, sin kx\ k e N ) , which are simple harmonic oscillations of entire 
frequencies. 

Suppose we have succeeded in representing a continuous function as the sum 

OO 

CIq \ > 

f(x)= h > ak cos kx + bk sin kx (18.34) 

2 k= l 

of a trigonometric series that converges uniformly to it. Then the coefficients of the 
expansion (18.34) can be easily and uniquely found. 

Multiplying Eq. (18.34) successively by each function of the system 

{1, cos ks, sinA'x; k e N}, 


using the fact that termwise integration is possible in the resulting uniformly con- 
vergent series, and taking account of the relations 



l 2 dx = 2 n. 


14 Writing the constant term in the form ag/2, which is convenient for Fourier series, is not obliga- 
tory here. 
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cos mx cos nx dx = 


£ 


cos z nx dx = 



sin mx sin nx dx = 0 for m n , m , n 

sin 2 nx dx — tc, n e N, 


eN, 


we find the coefficients 


1 r n 

ak — ak(f)—— I /(x)coskxdx, 
* J-7Z 

1 f 71 

bk-bk(f)—— f{x) sin kx dx , 


k = 0 , 1 ,..., 

(18.35) 

k= 1,2, ... 

(18.36) 


of the expansion (18.34) of the function / in a trigonometric series. 

We have arrived at the same coefficients that we would have had if we had re- 
garded (18.34) as the expansion of the vector / e 'IZif—Tt, jt] in the orthogonal 
system {1, cosAtx, sin kx: k e N|. This is not surprising, since the uniform conver- 
gence of the series (18.34) of course implies convergence in the mean on the closed 
interval [ — n , 7r ] , and then the coefficients of(18.34) must be the Fourier coefficients 
of the function / in the given orthogonal system (see Sect. 18.1). 


Definition 1 If the integrals (18.35) and (18.36) have meaning for a function /, 
then the trigonometric series 

a 0 {f ) ^ 

/ ~ — b ^ a k(f) cos kx + bkif ) sinfcx (18.37) 

k= l 

assigned to / is called the trigonometric Fourier series of f. 


Since there will be no Fourier series in this section except trigonometric Fourier 
series, we shall occasionally allow ourselves to drop the word “trigonometric” and 
speak of just “the Fourier series of /”. 

In the main we shall be dealing with functions of class 1Z([— it, tt], C), or, 
slightly more generally, with functions whose squared absolute values are integrable 
(possibly in the improper sense) on the open interval ]— 7r,7r[. We retain our pre- 
vious notation to denote the vector space of such functions with the 

standard inner product 

(f,g)= [ fg dx. (18.38) 

J —71 

Bessel’s inequality 


\aoif) I 2 


? 1 PX 

£>*(/)|" + hif) | < - / i/i 2 wdx, 


(18.39) 


2 
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which holds for every function / e lZ 2 ([—tt, tt] , C), shows that by no means ev- 
ery trigonometric series (18.32) can be the Fourier series of some function / e 

Hi [-7T, Tt]. 


Example 1 The trigonometric series 


LXJ . . 

sin far 

h ’ 

as we already know (see Example 7 of Sect. 16.2) converges on M, but is not the 
Fourier series of any function / e 1Z2[—tt,tt], since the series ^^i(-^) 2 di- 
verges. 

Thus, arbitrary trigonometric series (18.32) will not be studied here, only Fourier 
series (18.37) of functions in Ti. 2 [—tt,tt], and functions that are absolutely inte- 
grable on ]—tt, tc[. 


b. Mean Convergence of a Trigonometric Fourier Series 

Let 

a 0 m 

S n (x) = 1- ^fljt(/) cos kx + bk(f) sin kx, (18.40) 

2 k= l 

be the nth partial sum of the Fourier series of the function / e IZjl—tt, tt ] . The 
deviation of S n from / can be measured both in the natural metric of the space 
Till— Tt, tt] induced by the inner product (18.38), that is, in the sense of the mean- 
square deviation 

\\f~ s n\\=Jj n \f-S n \Hx)dx (18.41) 

of S n from / on the interval [— jt, it], and in the sense of pointwise convergence on 
that interval. 

The first of these two kinds of convergence was studied for an arbitrary se- 
ries in Sect. 18.1. Making the results obtained there specific in the context of a 
trigonometric Fourier series involves first of all noting that the trigonometric system 
{1, cos kx, sin kx: k e N| is complete in IZ 2 [—tt, it]. (This has already been noted in 
Sect. 18.1 and will be proved independently in Sect. 18.2.4 of the present section.) 

Hence, the fundamental theorem of Sect. 18.1 enables us to say in the present 
case that the following theorem is true. 

Theorem 1 (Mean convergence of a trigonometric Fourier series) The Fourier se- 
ries (18.37) of any function f e 75,2 ([ — tt, tt], C) converges to the function in the 
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mean (18.41), that is. 


a 0 (f) ^ 

fix) = — b 2J «*(/) cos kx + bkif) sin kx, 

n2 1 k = l 

and Parseval’s equality holds: 

~ r \f\ 2 (x)dx= la ° ( / )l “ +X]M/)1 2 +|M/)| 2 - (18.42) 

71 J ~ n 1 k= l 

We shall often use the more compact complex notation for trigonometric poly- 
nomials and trigonometric series, based on the Euler formulas e' A = cos x + i sin x, 
cos A' = ^ (e ,A + e~ lx ), sinx = jyi e ' x ~ e~ ,x ). Using them, we can write the partial 
sum (18.40) of the Fourier series as 


n 

Snix)= Y, C ^ kX ’ 

k=—n 

and the series (18.37) itself as 

+oo 

— OO 


(18.40') 


(18.37') 


where 


that is, 


Ck = 


^(ak — ibk), if/c>0, 

^ao, if k — 0, 

\{a-k + ib~k), iffc<0, 


(18.43) 


ck = Ckif) = ^~ f f(x)e~ ikx dx, keZ, (18.44) 

2?r J-tt 

and hence the numbers ty are simply the Fourier coefficients of / in the system 
{e ikx ;keZ}. 

We call attention to the fact that summation of the Fourier series (18.37') is un- 
derstood in the sense of the convergence of the sums (18.40'). 

In complex notation Theorem 1 means that for every function / e TZ 2 ([— tt, 7r],C) 


fix ) = Y'cU/V 

7 Z 2 / — 1 


kx 


and 

. OO 

^ii/ii 2 =X>(/)| 2 . 


— OO 


(18.45) 
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c. Pointwise Convergence of a Trigonometric Fourier Series 

Theorem 1 gives a complete solution to the problem of mean convergence of a 
Fourier series (18.37), that is, convergence in the norm of the space 7^-2 [ — tt, tt]. 
The remainder of this section will be mainly devoted to studying the conditions for 
and the nature of pointwise convergence of a trigonometric series. We shall consider 
only the simplest aspects of this problem. The study of pointwise convergence of a 
trigonometric series, as a rule, is such a delicate matter that, despite the traditional 
central position occupied by Fourier series after the work of Euler, Fourier, and Rie- 
mann, there is still no intrinsic description of the class of functions that can be repre- 
sented by trigonometric series converging to them at every point (the Riemann prob- 
lem). Until recently it was not even known whether the Fourier series of a continuous 
function must converge to it almost everywhere (it was known that convergence need 
not occur at every point). Previously A.N. Kolmogorov 15 had even given an exam- 
ple of a function / e L[— u, 7r] whose Fourier series diverged everywhere (where 
L[ — 7r,7r] is the space of Lebesgue-integrable functions on the interval [ — tt, jt], 
obtainable as the metric completion of the space 1Z[—jt, n]), and D.E. Men’shov 16 
constructed a trigonometric series (18.32) with coefficients not all zero that never- 
theless converged to zero almost everywhere (Men' shoV s null-series). The problem 
posed by N.N. Luzin 17 (Luzin s problem) of determining whether the Fourier series 
of every function f e L2[—tc, jt] converges almost everywhere (where LjI—tt, tt] 
is the metric completion of 72-2 [ — ?r]) was answered in the affirmative only in 
1966 by L. Carleson. 1 8 It follows in particular from Carleson’s result that the Fourier 
series of every function f e 72-2 [— jt, tt] (for example a continuous function) must 
converge at almost all points of the closed interval [—jt, jt]. 


18.2.2 Investigation of Pointwise Convergence of a Trigonometric 
Fourier Series 

a. Integral Representation of the Partial Sum of a Fourier Series 

Let us now turn our attention to the partial sum (18.40) of the Fourier series (18.37) 
and, using the complex notation (18.40') for the expression (18.44) for the Fourier 


15 A.N. Kolmogorov (1903-1987) - outstanding Soviet scholar, who worked in probability the- 
ory, mathematical statistics, theory of functions, functional analysis, topology, logic, differential 
equations, and applied mathematics. 

16 D.E. Men’shov (1892-1988) - Soviet mathematician, one of the greatest specialists in the theory 
of functions of a real variable. 

17 N.N. Luzin (1883-1950) - Russian/Soviet mathematician, one of the greatest specialists in the 
theory of functions of a real variable, founder of the large Moscow mathematical school (“Lusita- 
nia”). 

18 L. Carleson (b. 1928) - outstanding Swedish mathematician whose main works are in various 
areas of modem analysis. 
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coefficients, we make the following transformations: 


Sn(x) = VI — / f {t)6~ ikt dr le'** = 


':=—n 

i/>d/ 


dr. 


(18.46) 


But 


A, 00 := X] = 


a i(n+l)u _ p-inu i(n+\)u _ e — / (n+ ^)« 


k=—n 


e'» - 1 


e ! 'z» - e _ 'z" 


(18.47) 


and, as can be seen from the very definition, D n (u) = (2 n + 1) if e'“ = 1. 
Hence 


Dn (u) — 


sin(« + \)u 


l , 


(18.48) 


where the ratio is regarded as 2 n + 1 when the denominator of the fraction becomes 
zero. 

Continuing the computation (18.46), we now have 


S„(x) = ~ f f(t)D n (x - t)dt. 
2?r J-n 


(18.49) 


We have thus represented S n (x) as the convolution of the function / with the 
function (18.48), which is called the Dirichlet kernel. 

As can be seen from the original definition (18.47) of the function D n (u), the 
Dirichlet kernel is of period 2tt and even, and, in addition 


1 f n 1 r 

— D n (u)du = - / D n 
J _ 7l it Jq 


(m) &u — 1. 


(18.50) 


Assuming the function / is of period 2 it on R or is extended from [— jt, 7r] to ' 
so as to have period 2it , and making a change of variable in (18.49), we obtain 


. , 1 C n sin(« + i)r 

S„(x)=—I f{x-t)D n (t)dt = — J f(x-t) ; — dr. (18.51) 


i r* 

2 it J-n 


sin jt 


In carrying out the change of variable here, we used the fact that the integral of a 
periodic function is the same over every interval whose length equals a period. 
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Taking account of the fact that D n (t) is an even function, we can rewrite 
Eq. (18.51) as 


S„(x) = — J (f{x-t) + f{x + t)) D n {t) At = 

1 I 71 . sin(n + l)t 

= ~ ^ + r) ) 


dr. 


sin 4r 


(18.52) 


The Riemann-Lebesgue Lemma and the Localization Principle 

The representation (18.52) for the partial sum of a trigonometric Fourier series, 
together with an observation of Riemann stated below, forms the basis for studying 
the pointwise convergence of a trigonometric Fourier series. 


Lemma 1 (Riemann-Lebesgue) If a locally integrable function f : ]a>i ,® 2 [^Ris 
absolutely integrable ( perhaps in the improper sense) on an open interval ]co i, cc >2 [, 
then 


f 

J CO] 


f(x)e llx Ax - 


0 as X oo, X e 1 


(18.53) 


Proof If ]&> i, a> 2 [ is a finite interval and f(x) = 1, then Eq. (18.53) can be verified 
by direct integration and passage to the limit. 

We shall reduce the general case to this simplest one. 

Fixing an arbitrary e > 0, we first choose an interval [a, b] C ]co\, o> 2 [ such that 
for every X e R. 


2 

PCD 

r h 

/ f(x)e llx dr - 

/ f(x)e ,lx Ax 

1 J CO\ 

Ja 


(18.54) 


In view of the estimates 



< 


< 





d.r 


and the absolute integrability of / on \a> i, on[, there does of course exist such a 
closed interval [a,b]. 

Since / e lZ([a, b], M) (more precisely f\[ a ,b\ e 72-([« , b])), there exists a lower 
Darboux sum XT/=i m jAxj, where mj — inf xe [ Xj l jX j] f{x), such that 
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Now introducing the piecewise constant function g(x) — mj for x e [xj-\,Xj], 
j = 1 , .... n, we find that g(x) < fix) on [a, b] and 


I nb nb 

/ f(x)e! AX d.Y - / 

I j Cl j Cl 

f \fW- g(x)\\e llx \dx= f (f(x)-gix)) dx 
J a J a 


g(x)e dx 


<£, (18.55) 


but 


m je‘ Xx dx 


I g(x)e ikx dx = r 
Ja j=l Jxj^i 

= —Y(m,e iXx ) \ Xj 

iX^ y 1 nx i~ i 


0 as X — >■ oo, X e } 


(18.56) 


Comparing relations ( 1 8.53)— ( 18.56), we find what was asserted. 
Remark 1 Separating the real and imaginary parts in (18.53), we find that 


f 

J CO\ 


c 02 rco 2 

/(x) cos 7.x dx — »■ 0 and / f(x) sin Ax d.r — » 0 

J (D\ 


□ 


(18.57) 


as X — »■ oo, X e K. If the function / in the preceding integrals had been complex, 
then, separating the real and imaginary parts in them, we would have found that 
relations (18.57), and consequently (18.53), would actually be valid for complex- 
valued functions / : ]a>\, a> 2 [ — > C. 


Remark 2 If it is known that / e tt], then by Bessel’s inequality (18.39) 

we can conclude immediately that 

/ 7T f* 71 

fix) cos nx dx — > 0 and / fix) sin nx dx — > 0 

-7T J —71 

as n — > oo, n e N. Theoretically, we could have gotten by with just this discrete 
version of the Riemann-Lebesgue lemma for the elementary investigations of the 
classical Fourier series that will be carried out here. 


Returning now to the integral representation (18.52) of the partial sum of a 
Fourier series, we remark that if the function / satisfies the hypotheses of the 
Riemann-Lebesgue lemma, then, since sin > sin ^5 > 0 for 0 < 8 < t < n, we 
can use (18.57) to write 

1 sin (« + i)f 

Sn(x) = — (f(x-t) + f(x + t)) j— — dr + o(l) as n oo. (18.58) 

2jt J 0 sin 
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The important conclusion one can deduce from (18.58) is that the convergence 
or divergence of a Fourier series at a point is completely determined by the behavior 
of the function in an arbitrarily small neighborhood of the point. 

We state this principle as the following proposition. 

Theorem 2 (Localization principle) Let f and g be real- or complex-valued locally 
integrable functions on ]— jr, jt[ and absolutely integrable on the whole interval 
( possibly in the improper sense). 

If the functions f and g are equal in any ( arbitrarily small) neighborhood of the 
point xq e ]— jr, 7r[, then their Fourier series 



f(x) ~Y^ c k(fW kx , g(x) ^^2 c k(gW kx 


—oo —CO 


either both converge or both diverge at xq. When they converge, their limits are 


equal. 19 


Remark 3 As can be seen from the reasoning used in deriving Eqs. (18.52) and 
(18.58), the point xq in the localization principle may also be an endpoint of the 
closed interval [— jr, 7r], but then (and this is essential!) in order for the periodic 
extensions of the functions / and g from the closed interval [— jz, n] to K to be equal 
on a neighborhood of xq it is necessary (and sufficient) that the original functions 
be equal on a neighborhood of both endpoints of the closed interval [—jt, jt\. 

c. Sufficient Conditions for a Fourier Series to Converge at a Point 

Definition 2 A function / : U —*■ C defined on a deleted neighborhood of a point 
x e R. satisfies the Dini conditions at x if 

a) both one-sided limits 


f(x-)= lim f(x-t), f(x+)= lim f(x + t) 

t^+ 0 r->+0 


exist at x; 

b) the integral 



converges absolutely. 20 


19 Although the limit need not be f(x o) = g(*o). 

20 What is meant is that the integral / 0 E converges absolutely for some value e > 0. 
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Example 2 If / is a continuous function in U (x) satisfying the Holder condition 


|/(x + f)-/(x)| <M\t\ a , 0 < or < 1, 


then, since the estimate 


f(x + t) - /(x) < _M 


t ~ \t\ l - a 


now holds, the function / satisfies the Dini conditions at x. 

It is also clear that if a continuous function / defined in a deleted neighborhood 


U(x) of x has one-sided limits /(x_) and f(x+) and satisfies one-sided Holder 
conditions 


\f(x+t)~ f(x+)\ <Mt a , 
\f{x-t)~ f{x-)\ <Mt a , 


where r > 0, 0 < a < 1, and M is a positive constant, then / will satisfy the Dini 
conditions for the same reason as above. 

Definition 3 We shall call a real- or complex-valued function f piecewise- 
continuous on the closed interval [a, b] if there is a finite set of points a — x o < 
xi < • • • < x„ — b in this interval such that / is defined on each interval ]xj- 1 , Xj [, 
j = 1 , . . . , n, and has one-sided limits on approach to its endpoints. 

Definition 4 A function having a piecewise-continuous derivative on a closed in- 
terval is piecewise continuously differentiable on that interval. 

Example 3 If a function is piecewise continuously differentiable on a closed inter- 
val, then it satisfies the Holder conditions with exponent a — 1 at every point of the 
interval, as follows from Lagrange’s finite-increment (mean-value) theorem. Hence, 
by Example 1, such a function satisfies Dini’s conditions at every point of the in- 
terval. At the endpoints of the interval, of course only the corresponding one-sided 
pair of Dini’s conditions needs to be verified. 

Example 4 The function /(x) = sgnx satisfies Dini’s conditions at every point 
x e K, even at zero. 

Theorem 3 (Sufficient conditions for convergence of a Fourier series at a point) 
Let f : K — >• C be a function of period 2 : r that is absolutely integrable on the closed 
interval [— n, jt]. If f satisfies the Dini conditions at a point x e R, then its Fourier 
series converges at that point, and 



— oo 


2 


(18.59) 
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Proof By relations (18.52) and (18.50) 
fix-) + f{x+) 


S n O) - 


_ i r 
X Jo 


(/(■* - 0 - /(*-)) + (fix + t) - fjx+)) 
2 sin if 


sin ( ft + - )f df. 


Since 2 sin jf ~ f as t —*■ +0, by the Dini conditions and the Riemann-Lebesgue 
lemma we see that this last integral tends to zero as n -> oo. □ 


Remark 4 In connection with the theorem just proved and the localization principle, 
we note that changing the value of the function at a point has no influence on the 
Fourier coefficients or the Fourier series or the partial sums of the Fourier series. 
Therefore the convergence and the sum of such a series at a point is determined not 
by the particular value of the function at the point, but by the integral mean of its 
values in an arbitrarily small neighborhood of the point. It is this fact that is reflected 
in Theorem 3. 


Example 5 In Example 6 of Sect. 18.1 we found the Fourier series 

00 (_iyt+i 

x ~ 2 sin kx ( 1 8.60) 


for the function f(x) — x on the closed interval [ — tt, zr]. Extending the function 
f(x) periodically from the interval ]— tt, n[ to the whole real line, we may assume 
that the series (18.60) is the Fourier series of this extended function. Then, on the 
basis of Theorem 3 we find that 


£2 


k= 1 


(-!)*+ 1 
k 


sin kx = 


x, 

0, 


if |x| < 7 r, 
if |x| = 7 x. 


In particular, for x = j it follows from this relation that 


OO 


£ 

n = 0 


(-D n 

2n + 1 


7T 

4' 


Example 6 Let aeR and |a| < 1. Consider the 27r -periodic function fix) defined 
on the closed interval [— tt, tt] by the formula fix) = cosax. 

By formulas (18.35) and (18.36) we find its Fourier coefficients 

1 ( n (— 1)" sin7ra 2a 

a n if) = — I cos ax cos nx dx = ■ — ^ r- , 

7 r J-x Tt a- — n- 

1 f 71 

b„if) = — I cosax sin nx dx = 0. 

J — TT 
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By Theorem 3 the following equality holds at each point x e [—tv, i r]: 


2a sunra / 1 


(- 1 )" 


cos ax = I — r + y — cos nx \ . 

n V 2 a z L — ' or — n z 

\ n= 1 

When x — n , this relation implies 

1 2a ^ 1 

cot7ra = — > —= 

na it L — ' a A — n z 


(18.61) 


n= 1 


if i a I < an < 1, then I -j, and hence the series on the right-hand 

— 'a z -n zl ~ n 2 —aQ ° 

side of Eq. (18.61) converges uniformly with respect to a on every closed interval 
|qs | < Qfo < 1 - Hence it is legitimate to integrate it termwise, that is. 


and 


yielding 


and finally. 



shura 71 

In 

na 0 


^ln|a 2 — n 2 
n = 1 


71 


sin 7t x 

In 

nx 




sin^jc 

nx 




when \x\ < 1. 


(18.62) 


We have thus proved relation (18.62), which we mentioned earlier when deriving 
the complement formula for Euler’s function r(x) (Sect. 17.3). 


d. Fejer’s Theorem 


Let us now consider the sequence of functions 


o n(x) := 


So(x) + ■ ■ • + S n (x) 
n + 1 


that are the arithmetic means of the corresponding partial sums So(x ), . . . , S n (x) of 
the trigonometric Fourier series (18.37) of a function / : K — »■ C of period 2n. 
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From the integral representation (18.51) of the partial sum of the Fourier series 
we have 


a n (x) = 



f(x - t)J r n (t)dt, 


where 

Fn(t) — — — r(A)(0 + • ■ • + D n (t)). 

n+ 1 

Recalling the explicit form (18.48) of the Dirichlet kernel and taking account of 
the relation 


Esin 

k=0 





n 

^^(cos kt — cos (k + l)f) 
k = 0 


sin 2 (^±i)f 
sin \t 


we find 


Tnit) = 


sin 2 '4 lL ? 

( n +1) sin 2 jt 


The function T n is called the Fejer kernel, more precisely the nth Fejer kernel. 21 

Taking account of the original definition (18.47) of the Dirichlet kernel D n , one 
can conclude that the Fejer kernel is a smooth function of period 2 jt whose value 
equals ( n + 1) where the denominator of this last fraction equals zero. 

The properties of the Fejer and Dirichlet kernels are similar in many ways, but in 
contrast to the Dirichlet kernel, the Fejer kernel is also nonnegative, so that we have 
the following lemma. 


Lemma 2 The sequence of functions 

T^Fn(x), 


An(x) = 


2tt * 

o, 


if\x\<7t, 
if \X\ >TT 


is an approximate identity on R. 

Proof The nonnnegativity of A n (x ) is clear. 
Equality (18.50) enables us to conclude that 



A n (x) dx 


£ 


A n (x) dx = — 

2n /_ 


hf> 


(x) dx = 


27r(n + 1) ' ' 


E/ D k (x)dx = l. 
k= 0 ‘ 


21 L. Fejer (1880-1956) - well-known Hungarian mathematician. 
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Finally, for every S > 0 

/ — 5 /*+oo rn 

A n (x) dx = / A n (x)dx= / A n (x)dx< 

-oo JS J8 

1 

< 

27t(n + 1) 

as n — > +oo. □ 

Theorem 4 (Fejer) Let f : M — > C be a function of period 2 tc that is absolutely 
integrable on the closed interval [— it, tt]. If 

a) f is uniformly continuous on the set E cR, then 

(J n (x) =) fix) on E as n — >■ oo; 

b) / <= C(R,C), then 



a n (x) =1 f (x) on K as n —>■ oo; 

c) fis continuous at the point rel, then 

a,, 0 0 — >■ fix) as n — >■ oo. 

Proof Statements b) and c) are special cases of a). 

Statement a) itself is a special case of the general Proposition 5 of Sect. 17.4 on 
the convergence of a convolution, since 

f(x-t)J r n (t)dt = if*A n )(x). n 

Corollary 1 (Weierstrass’ theorem on approximation by trigonometric polynomi- 
als) If a function f : [ — 7T, jt] — >• C is continuous on the closed interval [ — 7T, 7t] 
and f(—Jt) — f (it), then this function can be approximated uniformly on the closed 
interval [—n, it ] with arbitrary precision by trigonometric polynomials. 

Proof Extending / as a function of period 2it, we obtain a continuous 2tc -periodic 
function on M, to which the trigonometric polynomials o n (x) converge uniformly 
by Fejer’s theorem. □ 

Corollary 2 If f is continuous at x, its Fourier series either diverges at x or con- 
verges to fix). 

Proof Only the case of convergence requires formal verification. If the sequence 
S n (x) has a limit as n — »■ oo, then the sequence cr„ (x) = has that same 

limit. But by Fejer’s theorem cr n ix) fix) as n oo, and hence S n ix) fix) 
also whenever the limit S„ix) exists as n — > oo. □ 


534 


1 8 Fourier Series and the Fourier Transform 


Remark 5 We note that the Fourier series of a continuous function really can diverge 
at some points. 


18.2.3 Smoothness of a Function and the Rate of Decrease 
of the Fourier Coefficients 

a. An Estimate of the Fourier Coefficients of a Smooth Function 

We begin with a simple, yet important and useful lemma. 

Lemma 3 (Differentiation of a Fourier series) If a continuous function f e 
C([—jt,jt],C) assuming equal values at the endpoints of the closed interval 
[— jt, 7r] is piecewise continuously differentiable on [— jr, tt], then the Fourier se- 
ries of its derivative 

OO 

— CO 

can be obtained by differentiating formally the Fourier series 

OO 

—CO 


of the function itself, that is. 


c k (f')=ikc k (f), keZ. (18.63) 

Proof Starting from the definition of the Fourier coefficients (18.44), we find 
through integration by parts that 

Ck(f') = fj f\x)e~ lkx dx = Ff(x)z-' kx \\ + f(x)e~ ,kx Ax = 

= ikc k (f), 

since f(jt)e~ ,kn — f(—7r)e lkn = 0. □ 

Proposition 1 (Connection between smoothness of a function and the rate of de- 
crease of its Fourier coefficients) Let f e C^'" _1 ' ) ([— jt, n\, C) and f^\—jt) — 
f U H*), 7=0,1,... , m — 1. If the function f has a piecewise-continuous deriva- 
tive Z 1 '”) of order m on the closed interval [— n, tt] , then 

c k {f {m) ) = (ik) m c k (f), keZ, 


(18.64) 
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and 



as k — >■ oo, k e Z; 


(18.65) 


E oo 2 

-oo Yk < °°' 


Proof Relation (18.64) follows from an w-fold application of Eq. (18.63): 


Ck{f (m) ) = (ik)c k (f (m - ]) ) =■■■ = ( ik) m c k (f ). 
Setting yk = \ c k(f {m ' )\ and using Bessel’s inequality 



□ 


we obtain (18.65) from (18.64). 


Remark 6 In the proposition just proved, as in Lemma 3, instead of assuming the 
conditions n), we could have assumed that / is a function of 

period 2n on the entire line. 

Remark 7 If a trigonometric Fourier series were written in the form (18.37), rather 
than in the complex form (18.37'), it would be necessary to replace the simple rela- 
tions (18.64) by noticeably more cumbersome equalities, whose meaning, however, 
would be the same: under these hypotheses a Fourier series can be differentiated 
termwise whichever from it is written in, (18.37) or (18.37'). As for the estimates of 
the Fourier coefficients, a^if ) and (?&(/) of (18.37), since a*(/) = Ck(f) + C-k(f) 
and bkif ) = i ( Ck(f ) — c_t(/)), (see formulas (18.43)) it follows from (18.65) that 
if a function / satisfies the hypotheses of the proposition, then 



(18.64') 


where aj:<oo and fj: < oo, and we can assume that oik — Pk = Yk + 

Y-k- 


b. Smoothness of a Function and the Rate of Convergence of Its Fourier Series 

Theorem 5 If the function f : [— jr, jt] -> C is such that 

a) / e jr, jt], m e N, 

b ) = f(j)( n) , j = 0, 1 , ... , m - 1 , 

c) / has a piecewise continuous mth derivative /7«) on [ — tt, jt], m > 1, then 
the Fourier series of f converges absolutely and uniformly on [— it , 7 r] to /, and 
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the deviation of the nth partial sum S n (x ) of the Fourier series from fix) has the 
following estimate on the entire interval. 

| f(x)-S n (x)\<-J^_, 

where {£„} is a sequence of positive numbers tending to zero. 


Proof We write the partial sum (18.40) of the Fourier series in the compact notation 

(18.40'): 

n 

Snix) = ^2c k (f)e ,kx . 

—n 

According to the assumptions on the function / and Proposition 1 we have 
\c k (f)\ = Yk/\k\ m , and YYk/\k\ m < since 0 < Yk /\k\ m < \iyl + 1/ k 2m ) and 
m > 1, we have Y,Yk/\k\ m < oo. Hence the sequence S n (x) converges uniformly 
on [— 7r, tv ] (by the Weierstrass M-test for series and the Cauchy criterion for se- 
quences). 

By Theorem 3 the limit S (x) of S n (x) equals /( x), since the function / satisfies 
the Dini conditions at each point of the closed interval [— n, jr] (see Example 3). 
And, since /(— it) — fin), the function / can be extended to R as a periodic func- 
tion with the Dini conditions holding at each point x e R. 

Now, using relation (18.63), we can proceed to obtain an estimate: 


\f{x)-S n ix)\ = |5(X)-5„(X)| = 


Yl, c kW)z l 

±k=n + 1 


kx 


< Y \ c k(f)\= Y Yk! \k\ m < 

±k=n + 1 ±k=n + 1 

( oo \V2 / oo \ 1/2 

E rl) E '/t 2 ") . 

±k=n- fl / \±/:=«+l / 


The first factor on the right-hand side of the Cauchy-Bunyakovskii inequality 
here tends to zero as n — > oo, since Y^oo Yk < °°- 
Next (see Fig. 18.2) 


E v* 


2m 


k=n+l 



1 

2m — 1 


1 


n 


2m— 1 


We thus obtain the assertion of Theorem 5. 


□ 


In connection with these results we now make a number of useful remarks. 
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Fig. 18.2 




Remark 8 One can now easily obtain again the Weierstrass approximation theorem 
stated in Corollary 1 from Theorem 5 (and Theorem 3, of which essential use was 
made in the proof of Theorem 5), independently of Fejer’s theorem. 

Proof It suffices to prove this result for real-valued functions. Using the uniform 
continuity of / on [— 7r,7r], we approximate / on this closed interval uniformly 
within e/2 by a piecewise-linear continuous function ip(x) assuming the same val- 
ues as / at the endpoints, that is, <p(— tv) = (pin) = f(n ) (Fig. 18.3). By Theorem 5 
the Fourier series of (p converges to <p uniformly on the closed interval [— jr, tt]. 
Taking a partial sum of this series that differs from tp(x) by less than e/2, we obtain 
a trigonometric polynomial that approximates the original function / within e on 
the whole interval [— jt, n], □ 

Remark 9 Let us assume that we have succeeded in representing a function / hav- 
ing a jump singularity as the sum / = <p + xfr of a certain smooth function if and 
a simple function q> having the same singularity as / (Fig. 18.4a, c, b). Then the 
Fourier series of / is the sum of the Fourier series of if, which converges rapidly 
and uniformly by Theorem 5, and the Fourier series of the function (p. The latter 
can be regarded as known, if we take the standard function cp (shown in the figure 
as (p(x) = —n — x for — tc < x < 0 and q>(x) — tc — x for 0 < x < n). 

This observation can be used both in theoretical and computational problems 
connected with series (it is Krylov’s 22 method of separating singularities and im- 


“A.N. Krylov (1863-1945) - Russian/Soviet specialist in mechanics and mathematics, who made 
a large contribution to computational mathematics, especially in methods of computing the ele- 
ments of ships. 
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IX y 

/ 1 

— 7T 

0 

7 r 


C. 



Fig. 18.4 


proving the convergence of series) and in the theory of trigonometric Fourier series 
itself (see for example the Gibbs 23 phenomenon, described below in Problem 1 1). 

Remark 10 (Integration of a Fourier series) By Theorem 5 we can state and prove 
the following complement to Lemma 3 on differentiation of a Fourier series. 

Proposition 2 If the function f : [— jr, tt] — »■ C is piecewise continuous, then after 
integration the correspondence fix) ~ Ckif)s. lkx becomes the equality 

f /<» )d, = c„(/), + f;'5L)(e“'-l), 

_oo lk 

where the prime indicates that the term with index k = 0 is omitted from the suin', the 
summation is the limit of the symmetric partial sums n , and the series converges 
uniformly on the closed interval [—it, jr ]. 


Proof Consider the auxiliary function 

F(x)= f f{t) At — coif)x 
Jo 

on the interval [— n, n]. Obviously F e C[— n, n]. Also Fi—it) = Fin), since 


Fin)-Fi-n)=f fit)dt-2nc 0 if) = 0, 


as follows from the definition of co(/). Since the derivative F\x) = fix) — coif) 
of the function F is piecewise continuous, its Fourier series Q(F)e' h con- 

verges uniformly to F on the interval [— n, n] by Theorem 5. By Lemma 3 we have 
CkiF) = Ck ^ ■* for k 0. But cifF') = c^iF) if k 0. Now writing the equality 
Fix) = '^2°^ 00 CkiF)e ,kx in terms of the function / and noting that F( 0) = 0, we 
obtain the assertion of the proposition. □ 


23 J.W. Gibbs (1839-1903) - American physicist and mathematician, one of the founders of ther- 
modynamics and statistical mechanics. 
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18.2.4 Completeness of the Trigonometric System 

a. The Completeness Theorem 

In conclusion we return once again from pointwise convergence of the Fourier series 
to its mean convergence (18.41). More precisely, using the facts we have accumu- 
lated on the nature of pointwise convergence of the Fourier series, we now give a 
proof of the completeness of the trigonometric system {1, cos kx , s'vnkx: k e N| in 
'R. 2 ([~tc, it], Rj independent of the proof already given in the problems. In doing 
so, as in Sect. 18.2.1, we take 7?_2 ([ — tt, tt] , R) or 72-2 ([ — tt, 7r] , (C) to mean the vec- 
tor space of real- or complex-valued functions that are locally integrable on ] — jr, tt[ 
and whose squared absolute values are integrable on ] — jr, jt[ (possibly in the im- 
proper sense). This vector space is assumed to be endowed with the standard inner 
product (18.38) generating the norm in terms of which convergence is mean conver- 
gence (18.41). 

The theorem we are about to prove asserts simply that the system of trigonomet- 
ric functions is complete in 7^.2 C[ — tt], C). But we shall state the theorem in such 
a way that the statement itself will contain the key to the proof. It is based on the 
obvious fact that the property of completeness is transitive: if A approximates B 
and B approximates C, then A approximates C. 

Theorem 6 (Completeness of the trigonometric system) Every function f e 
IZii—Tt, 7 r] can be approximated arbitrarily closely in mean 

a) by functions of compact support in ] — tt, tc[ that are Riemann integrable over 
the closed interval [— jr, 7r] ; 

b) by piecewise-constant functions of compact support on the closed interval 
[—tt, tt]; 

c) by piecewise-linear continuous functions of compact support on the closed 
interval [— 7r, it]\ 

d) by trigonometric polynomials. 

Proof Since it obviously suffices to prove the theorem for real-valued functions, we 
confine ourselves to this case. 

a) It follows from the definition of the improper integral that 



Hence, for every e > 0 there exists S > 0 such that the function 


fix), if |jc I <7 t — S, 

0 if 7T — S < \x\ < TT, 
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will differ in mean from / on [— tx, tx] by less than e, since 



- fsf(x) d.v = 



f 2 (x)dx 



(x ) dx. 


b) It suffices to verify that every function of the form can be approximated 
in TZ2({—7t, 7r] , R) by piecewise-constant functions of compact support in [— tx, jt]. 
But the function f$ is Riemann-integrable on [— u + S,jt — <5], Hence it is bounded 
there by a constant M and moreover there exists a partition —n + 8 — xq < x\ < 
■ ■ ■ < x n — re — 8 of this closed interval such that the corresponding lower Darboux 
sum Yl'i=\ m iAxj of the function fs differs from the integral of fs over [— 7r + 
8, n — S] by less than e > 0. 

Now setting 

I mi , if x e Jxj-ji x; [, i = 1 
0, at all other points of [— tt, tt], 

we obtain 


/: 


(fs~ g) (x) dx 


£ 


I fs - gll/a + sl(*)d* 


< 2 M 


/ n—8 
-n+8 


(fs- g)(x)dx <2Ms, 


and hence fs really can be approximated arbitrarily closely in the mean on [— tt, tx] 
by piecewise-constant functions on the interval that vanish in a neighborhood of the 
endpoints of the interval. 

c) It now suffices to learn how to approximate the functions in b) in mean. Let 
g be such a function. All of its points of discontinuity x\, . . . ,x n lie in the open 
interval ] — jt, tx[. There are only finitely many of them, so that for every e > 0 one 
can choose <5 > 0 so small that the S -neighborhoods of the points x\, ... ,x n are 
disjoint and contained strictly inside the interval ]—jx,tx[, and 28nM < s, where 
M = sup| Y | <7r |g(x)|. Now replacing the function g on [x,- — S, x,- + <5], i = 1, ...,n, 
by the linear interpolation between the values g (x; — S) and g(xj +5) that g assumes 
at the endpoints of this interval, we obtain a piecewise linear continuous function 
gs that is of compact support in [—tx, n]. By construction |gs(x)| < M on [—tx, tx], 
so that 


/V. 


g5r(x)dx < 2 M / \g - g 8 \(x)dx = 


/ 
u 


= 2 M 

i = 1 

< 4 Ms, 


|g — gs | (x) dx < 2 M ■ (2 M ■ 28) ■ n < 


and the possibility of the approximation is now proved. 
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d) It remains only to show that one can approximate any function of class c) 
in mean on [ — tt, tt] by a trigonometric polynomial. But for every e > 0 and ev- 
ery function of type gs , Theorem 5 enables us to find a trigonometric polynomial 
T„ that approximates gs uniformly within s on the closed interval [ — tt , tt ] . Hence 
f- K (g8 ~ T n ) 2 dx < 2 ne 2 , and the possibility of an arbitrarily precise approxima- 
tion in mean by trigonometric polynomials on [—tt, tt] for any function of class c) 
is now established. 

By the triangle inequality in TZ2[—7t, tt] we now conclude that all of Theorem 6 
on the completeness of these classes in IZ 2 [ — jt, tt] is also proved. □ 


b. The Inner Product and Parseval’s Equality 


Now that the completeness of the trigonometric system in H. 2 ([—tt, tt], C) has been 
proved, we can use Theorem 1 to assert that 

ao(f) ^ 

f — H 2_, a k(f) c °skx + b k (f) sinfcx (18.66) 

fc= l 

for every function / e 'R. 2 ([—tt, tt], C), or, in complex notation, 

OO 

f = J2 c k(f)e lkx (18.67) 

— OO 


where the convergence is understood as convergence in the norm of 72-2 [ — tt , tt], that 
is, as mean convergence, and the limiting passage in (18.67) is the limit of sums of 
the form S n (x) = Yl-n c k(fW kx 4 s n -* °°- 
If we rewrite Eqs. (18.66) and (18.67) as 


1 fl 0 (/ ) 1 

\/tt \[2 ~J2tt 


^2 a k(f) 


cos kx 


k= l 


b k (f) 


sin kx 


1 °° fjkx 

/=f = J2c k (f) -=, 

v2jr \J2tt 


(18.66') 

(18.67') 


then the right-hand sides contain series in the orthonormal systems 

11 1 

, — — cos kx, — =sinfcx;A:eN 

■yj 2 Tt V ? X ■yjjt 

and { —^=e ,kx ; k e ZJ. Hence by the general rule for computing the inner product 
of vectors from their coordinates in an orthonormal basis, we can assert that the 
equality 


1 

TT 


(f,g) = 


ao(f)ao(g) 


OO 

+ ^2a k (f)d k (g) + b k (f)b k (g) 
k= l 


(18.68) 


2 
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holds for functions / and g in IZ2 ([— 7T, n], C), or, in other notation, 


— </> 8 ) = ^c k (f)c k (g), 


(18.69) 


— OO 


where, as always, 


( f,g)= f(x)g(x)dx. 


J —71 


In particular, if f = g, we obtain the classical Parseval equality from (18.68) and 
(18.69) in two equivalent forms: 



(18.70) 


k=l 



(18.71) 


— OO 


We have already noted that from the geometric point of view Parseval’s equality 
can be regarded as an infinite-dimensional version of the Pythagorean theorem. 
Parseval’s relation provides the basis for the following useful proposition. 

Proposition 3 (Uniqueness of Fourier series) Let f and g be two functions in 
IZ2 [— Jr, it]. Then 

a) if the trigonometric series 



converges in mean to f on the interval [—tv, n ] it is the Fourier series of f ; 

b) if the functions f and g have the same Fourier series, they are equal almost 
everywhere on [jt, : r], that is, f = g in tt]. 

Proof Assertion a) is actually a special case of the general fact that the expansion 
of a vector in an orthogonal system is unique. The inner product, as we know (see 
Lemma lb) shows immediately that the coefficients of such an expansion are the 
Fourier coefficients and no others. 

Assertion b) can be obtained from Parseval’s equality taking account of the com- 
pleteness of the trigonometric system in TZ 2 ([—tt, jt], C), which was just proved. 

Since the difference (/ — g) has a zero Fourier series, it follows from Parseval’s 
equality that \\f — g\\Tz 2 = 0- Hence the functions / and g are equal at all points of 
continuity, that is, almost everywhere. □ 


18.2 Trigonometric Fourier Series 


543 


Remark 11 When studying Taylor series YY^L o ^ (x — a) n we noted previously 
that different functions of class C^°°^(R, R) can have the same Taylor series (at 
some points a e M). This contrast with the uniqueness theorem just proved for the 
Fourier series should not be taken too seriously, since every uniqueness theorem is 
a relative one in the sense that it involves a particular space and a particular type of 
convergence. 

For example, in the space of analytic functions (that is, functions that can be 
represented as power series YYnLo (z — Zo)" converging to them pointwise), two 
different functions have distinct Taylor series about every point. 

If, in turn, in studying trigonometric series we abandon the space lZ 2 [—n, zr ] 
and study pointwise convergence of a trigonometric series, then, as already noted 
(p. 524) one can construct a trigonometric series not all of whose coefficients are 
zero, which nevertheless converges to zero almost everywhere. According to Propo- 
sition 3 such a null-series of course does not converge to zero in the mean-square 
sense. 

In conclusion, we illustrate the use of the properties of trigonometric Fourier se- 
ries obtained here by studying the following derivation, due to Hurwitz, 24 of the 
classical isoperimetric inequality in the two-dimensional case. In order to avoid 
cumbersome expressions and accidental technical difficulties, we shall use complex 
notation. 

Example 7 Between the volume V of a domain in the Euclidean space E n , n > 2, 
and the (n — 1) -dimensional surface area F of the hypersurface that bounds it, the 
following relation holds: 

n n v n V n -'<F’\ (18.72) 

called the isoperimetric inequality. Here v n is the volume of the n -dimensional unit 
ball in E n . Equality in the isoperimetric inequality (18.72) holds only for the ball. 

The name “isoperimetric” comes from the classical problem of finding the closed 
plane curve of a given length L that encloses the largest area S. In this case inequal- 
ity (18.72) means that 

4j rS<L 2 . (18.73) 

It is this inequality that we shall now prove, assuming that the curve in question 
is smooth and is defined parametrically as x = ip{s), y = \jr (s), where s is arc length 
along the curve and (p and i jr belong to C (1, [0, L\. The condition that the curve be 
closed means that <p(0) = (p(L) and i/'TO) = 

We now pass from the parameter s to the parameter t = 2jt ^ — tt, which ranges 
from — 7r to 7T, and we shall assume that our curve is defined parametrically as 

x=x(t), y = y(t), —it <t<jr, (18.74) 


24 A. Hurwitz (1859-1919) - German mathematician, a student of F. Klein. 
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with 


x(— 7t) = y(—n) = y(Tt). 

We write (18.74) as a single complex-valued function 

Z = Z(t), —Jt <t<7T, 

where z{t) = x(t ) + iy(t) and by (18.75) z(—Jt) = z(Jt). 
We remark that 

|z'(t)| 2 = {x\t)f + (; y'(t )) 2 = 


(18.75) 


(18.74') 


and hence under our choice of parameter 

|2 


L 2 


' (,) l 


(18.76) 


Next, taking into account the relations zz! = (x — iy)(x' + iy ') = (xx' + yy') + 
i (xy’ — x'y), and using Eqs. (18.75), we write the formula for the area of the region 
bounded by the closed curve (18.74): 

s =^j n {xy' -yx')(t)dt = ^ ^ z\t)z(t)dt. (18.77) 

We now write the Fourier series expansion of the function (18.74'): 

OO 

z(l) = ^2 c ke lkt - 


Then 


i '(f) ~ Y,ikc k e lkt . 


Equalities (18.76) and (18.77) mean in particular that 


— Ik' II 2 

2n ' 2jt J_ 


hi: m 


2 L 1 

_dr = ^’ 
47T 2 


and 


Yi z ' > z ) = ( z\t)z(t)dt = -S. 

2n 2 jt 7r 

In terms of Fourier coefficients, as follows from Eqs. (18.69) and (18.71), these 
relations assume the form 


L 2 = 4;r 2 Y\ kc k\ 2 ’ 


— OO 
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oo 

S = 7T y ^ k-CkCk- 

—oo 


Thus, 

oo 

L 2 - 4jtS = 4tt 2 J2i kl - k )\ck\ 2 - 

— OO 

The right-hand side of this equality is obviously nonnegative and vanishes only 
if q = 0 for all k e Z except k — 0 and k — \ . 

Thus, inequality (18.73) is proved, and at the same time we have found the equa- 
tion 

z(t) — co + cie' r , —7T<t<n, 

of the curve for which it becomes equality. This is the complex form of the para- 
metric equation of a circle with center at c o in the complex plane and radius |ci|. 


18.2.5 Problems and Exercises 


1. a) Show that 


E 

n = 1 


sinux 


7 r 


for 0 < x < 2i r, 


and find the sum of the series at all other points of R. 

Using the preceding expansion and the rules for operating with trigonometric 
Fourier series, now show that the following equalities are true. 

"'OO sin 2kx _ 

4 


b) ££t 

0 2Xt 

d) £“t^ 


lk — 4 — | for 0 < x < re . 

oo sin^-Ox _ | for 0<X<JT _ 

■ sin nx = | for |x| < it. 

e) x 2 = j + ^ 2 * cos nx for |x| < 7r. 

f) ^ = f-^££t £ ^^ £ forO<x< ff . 

g) 3x 2 —6xx+2tt 2 _ ^00 t co^x for Q < x < . 

h) Sketch the graph of the sums of the trigonometric series here over the entire 
real line R. Using the results obtained, find the sums of the following numerical 


series: 


2. Show that: 


yy (- 1 )” y]_ yy (- 1 )" 

•4^ 2n + 1 ’ 4^ n 2 ’ n 2 

n= 0 n = 1 n = 1 


a) if / : [— jr, it ] — ^ U is an odd (resp. even) function, then its Fourier coeffi- 
cients have the following property: a^if) = 0 (resp. bk(f) = 0) for k — 0, 1, 2, ; 
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b) if / : R. — > C has period 2 tt / m then its Fourier coefficients Ck(f) can be 
nonzero only when k is a multiple of m ; 

c ) if / : [— 7T , tt] -> R is real-valued, then Ck(f) — c-k(f) for all k e N; 

d) \ak(f)\ < 2 sup |A .| <7r \f(x)\, \b k (f)\ < 2 sup w<3r |/(x)|, \c k (f)\ < 

su P|.v|<^ I /(■*)!• 

3. a) Show that each of the systems {cos&x; k — 0, 1, . . .} and {sin/rx; k e N} is 
complete in the space lZ 2 [a, a + n] for any value of a e R. 

b) Expand the function fix) — x in the interval [0, tt] with respect to each of 
these two systems. 

c) Draw the graphs of the sums of the series just found over the entire real line. 

d) Exhibit the trigonometric Fourier series of the function fix) — |x| on the 
closed interval [— it, tt] and determine whether it converges uniformly to this func- 
tion on the entire closed interval [ — zr, zr], 

4. The Fourier series Y^oo c kif) t ' k ’ °f a function / can be regarded as a spe- 
cial case of a power series Jf°^ OQ c kZ k (= J2-lo c kZ k + ^o° CkZ k ), in which z is 
restricted to the unit circle in the complex plane, that is, z = e'E 

Show that if the Fourier coefficients Ck( f) of the function / : [— jt, tt] -* C 
vanish so rapidly that lim A .^_ 00 | Ck(f)\ l ^ k — c_ > 1 and limi^ +00 \ck(f)\ l ^ k — 
c+ < 1 , then 

a) the function / can be regarded as the image of the unit circle under a function 
represented in the annulus c_ l < |z| < C+ 1 by the series Yl™oo c kZ k \ 

b) for z = x + iy and In d- < y < In d- the series Ck(f)& ,kx converges 

absolutely (and, in particular, its sum is independent of the order of summation of 
the terms); 

c) in any strip of the complex plane defined by the conditions a < Im z < b, 
where In < a < b < In the series Y^oo c k(f) e ' kx converges absolutely and 
uniformly; 

2 

d) using the expansion c" = ^ + f( ^ and Euler’s formula e' v = cos x + 

i sinx, show that 


cosx 

nr 

sinx 


+ ••• 


1 ! 


+ • 


cos nx 
nl 

sin nx 


— e 


nl 


+ • • • = e L 


cos(sinx), 


sin(sinx); 


2 4 

e) using the expansions cos z= 1 — ^ — F §j 


verify that 


• • and shu = z - §r + §r 


n = 0 


cos(2 n + l)x 
(2n + l)! 


sin(cosx) cosh(sinx). 




sin(2n + l)x 


n = 0 


sin(cosx) sinh(sinx), 


(2n+ 1)! 
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E cos Inx 

(—1) = cos(cos^) cosh(sinx), 

(2?z)! 

n — 0 

oo . 0 

E sin Inx 

(—1) = cos(cosx) sinh(sin.r). 

«= o ^ ^ 


5. Verify that 

a) the systems {1, cosAr^x, sin£^x; k e N} and {e lk ^r x ; k e Z} are orthogonal 
in the space r R, 2 i[a, a + T ], C) for every aeR; 

b) the Fourier coefficients akif), bk(f ), and c*(/) of a T -periodic function / 
in these systems are the same whether the Fourier expansion is done on the interval 
[— y, y] or any other closed interval of the form [a, a + T]\ 

c) if Ck(f) and Ck(g) are the Fourier coefficients of '/’-periodic functions / and 
g, then 

j na+T 00 


po+l ^ 

/ fWgix) dx = ^ ck(f)ck(g)\ 

J a 


d) the Fourier coefficients Ck(h) normalized by the factor i of the “convolution” 


h(x) = 


-u 


fix -t)g(t)dt 


of T -periodic smooth functions / and g and the coefficients Ckif) and c+ig) of the 
functions themselves are related by Ckih) — ck(f)ckig), k e Z. 

6. Prove that if a is incommensurable with n , then 

a) lim^oo 1, E,it e ik(x+na) = l f” n ^ df ; 

b) for every continuous 2n -periodic function / : K -> C 


1 N \ C 71 

lim TtI 7/(* + nQ! ) = ^ / ffidt- 

v->oo N z — ' 2it J_ n 


7. Prove the following propositions. 

a) If the function / : R. — »• C is absolutely integrable on R, then 


f (x)e ,kx d.r 


/( x + ~ I - fix ) dx. 


b) If the functions / : R — > C and g : R — >• C are absolutely integrable on R and 
g is bounded in absolute value on R, then 


f(x + t)g(t)e ,xt df =: (p\ix ) =4 0 on R as X -+ oo. 
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c) If / : M — ► C is a 2jt -periodic function that is absolutely integrable over a 
period, then the remainder S n (x ) — f (x ) of its trigonometric Fourier series can be 
represented as 

S n (x) - f(x) = ^ JJ (A 2 f)(x,t)D n (t)dt, 

where D n is the nth Dirichlet kernel, and (A 2 f)(x,t) — f(x + t) — 2 f(x) + 
fix - t). 

d) For every 8 e ]0, n[ the formula for the remainder just obtained can be 
brought into the form 

1 f s sin nf , , , 

Sn(x)-f(x)=- / (A 2 f)(x,t)dt + o(l), 

where o(l) trends to zero as n — > oo, and uniformly on each closed interval [a, b] 
on which / is bounded. 

e) If the function / : [ — tt, zr] — ► C satisfies the Holder condition |/(xi) — 
fix 2)| < M\x\ — X 2 \ a on [—7 r, n] (where M and a are positive numbers) and in 
addition /(— tt) = /( tt), then the Fourier series of / converges to it uniformly on 
the entire interval. 

8. a) Prove that if / : M — >■ R is a 27r -periodic function having a piecewise contin- 
uous derivative of order m (m e N), then / can be represented as 

/ (x) = ^ + - [ B m (t - x)f (m > ( t ) d t, 

2 TT J-x 

where B m {u) = 1 LOs(<: “+ 2 ^ m e N. 

b) Using the Fourier series expansion obtained in Problem 1 for the function 
^2^ on the interval [0, 27 t], prove that B\ (u) is a polynomial of degree 1 and B m (u) 
is a polynomial of degree m on the interval [0, 2 jt]. These polynomials are called 
the Bernoulli polynomials. 

c) Verify that B m ( u ) dw = 0 for every m e N. 

9. a) Let x m = w = 0, 1, . . . , 2n. Verify that 


2 

2n + 1 


2 n 

Y[ cos kx m cos lx m — Ski , 

m = 0 


2 

2 n + 1 


2 n 

'Y] sin kx m sin lx m — Ski, 

m = 0 


2 n 

'Y] sin kx m cos lx m — 0, 

m — 0 


where k and / are nonnegative integers, Ski = 0 for kf^l, and Ski — 1 for k = l. 
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b) Let / : R. -> M be a 2n -periodic function that is absolutely integrable over 
a period. Let us partition the closed interval [0, 2tt] into In + 1 equal parts by the 
points x m — 2 n+\ ’ m — 0, l, 2 n. Let us compute the integrals 

1 C 2n 1 C 2n 

aj < (f) = — I f(x)coskxdx, bk{f) = — I f(x)sinkxdx 

x Jo n Jo 

approximately using the rectangular method corresponding to this partition of the 
interval [0, 2: r]. We obtain the quantities 



which we place in the nth partial sum S n (f, x) of the Fourier series of / instead of 
the respective coefficients «*.-(/) and bk(f )■ 

Prove that when this is done the result is a trigonometric polynomial S n (f, x) of 
order n that interpolates the function / at the nodes x m , m — 0 . 1 , . . . , 2 n, that is, at 
these points f(x m ) — S n (x, m). 

10 . a) Suppose the function / : [a, b] -> R is continuous and piecewise differen- 
tiable, and suppose that the square of its derivative /' is integrable over the interval 
]a, b[. Using Parseval’s equality, prove the following: 

a) if [a,b] = [0,7 r], then either of the conditions /( 0) = f(Jt) = 0 or 
fo f( x ) d-v = 0 implies Steklov’s inequality 

f f 2 (x) dx < [ (f') 2 (x)dx, 

Jo Jo 


in which equality is possible only for f (x) — a cos x; 

b) if [a, b] — [ — 7T, it ] and the conditions /(— jr) = /( tt) and f fix) dr = 0 
both hold, then Wirtinger’s inequality holds: 


/ 7T rTT 

f 2 (x)dx < I {ff 

-7T J —71 


(x)d.v, 


where equality is possible only if fix) = a cos ,r + b sin x. 

11 . The Gibbs phenomenon is the behavioral property of the partial sums of a 
trigonometric Fourier series described below, first observed by Wilbraham (1848) 
and later (1898) rediscovered by Gibbs ( Mathematical Encyclopedia , Vol. 1, 
Moscow, 1977). 
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a) Show that 


4 ^ sin(2fc - l)x . , , 

2k — \ for l"l<^ 

k = 1 


b) Verify that the function S n (x) = ^ 121 = t 

and that as n —>■ oo 


4 sin(2fc— 1 )jc 


j — has a maximum at x = ^ 


/ tt \ 2 4^ sin(2A: — 1)#- tt 2 T 71 sinx 

S„ =-V — dx « 1.179. 

\2n/ (2*-!)^ n it J 0 x 


Thus the oscillation of S n {x) near = 0 as n — >■ oc exceeds the jump of the 
function sgn x itself at that point by approximately 1 8 % (the jump of S n (x) “due to 
inertia”). 

c) Describe the limit of the graphs of the functions S n (x) in problem b). 

Now suppose that S n {f,x) is the nth partial sum of the trigonometric Fourier 
series of a function / and suppose that S„(f, x ) — > /(x) in a deleted neighborhood 
0 < |x — £ | < S of the point f as n —*■ oo and that / has one-sided limits /(§_) and 
f(f+) at § . For definiteness we shall assume that /(£_) < fif+). 

We say that Gibbs’ phenomenon occurs for the sums S n (f,x ) at the point f if 
lhn„^oo 5 n(/, •*) < /(£-) < /(?+) < lim„^oo S„(/, x). 

d) Using Remark 9 show that Gibbs’ phenomenon occurs at the point f for 
every function of the form q>fx) + csgn(x — £), where c ^ 0, |£| < n, and e 
C (1) [-7r,jr]. 

12. Multiple trigonometric Fourier series. 

a) Verify that the system of functions (2 J )n /2 e lkx , where k = {k\ , . . . , k n ), x = 
(xi, . . . , x„), kx = k\x\ + ■ ■ ■ + k n x n , and k\, ...,k n e Z, is orthonormal on any 
n -dimensional cube / = {x e K“ a ; - < x; < aj + 2 tc, j — 1,2,..., n }. 

b) To a function / that is integrable over I we assign the sum / ~ 
12^00 c k(fW kx y which is called the Fourier series of f in the system { (2 Jy ,/2 

if c/ff) = pjyp fj f(x)e~ lkx dx. The numbers q(/) are called the Fourier coef- 
ficients of f in the system { *„/ 2 e lkx }. 

In the multidimensional case the Fourier series is often summed via the partial 
sums 


Sn(x) = c k(fW kx , 
\k\<N 


where |&| < N means that N = (N \ , . . . , iV*) and \kj \ < Nj , j = 1 , . . . , n. 
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Show that for every function f(x) = f{x i, ... ,x n ) that is 2tc -periodic in each 
variable 


S n (x)= — I [ [ D Nj (tj -Xj)f(t)dt = 
;= i 


■“-hi, n 

7 = 1 

j /* 7T rn n 

— ■■■ .fit - x) ]~[ D Nj ( tj ) dfi - - - d t n , 

t J— 7T J —7Z . , 


where D^j (u) is the ,'Vy th one-dimensional Dirichlet kernel, 
c) Show that the Fejer sum 


o N (x) := 


: =«TT S s ‘ w = 

k = 0 


1 


N i 1V„ 


(iV 1 + l)-...-(lV„ + l) 


5Z 


il=0 k n =0 


of a function f(x) = f(x\ , , x n ) that is 2jt -periodic in each variable can be rep- 
resented as 


vn(x) = — 

71 


- f f(t- 

t" 7/ ■ 


x)&N(t) df, 


where <P]y(u) = n}-i (« /) and Tnj is the /V ; th one-dimensional Fejer kernel. 

d) Now extend Fejer’s theorem to the n -dimensional case. 

e) Show that if a function / that is 2tt -periodic in each variable is absolutely 
integrable over a period I, possibly in the improper sense, then fj f (x + u) — 
f(x ) | dx — > 0 as m — > 0 and fj\f — o 7 v|(x)dx -» 0 as N -» oo. 

f) Prove that two functions / and g that are absolutely integrable over the cube 
I can have equal Fourier series (that is, Ckif) = Ckg ) for every multi-index k) only 
if f(x) = g(x) almost everywhere on I . This is a strengthening of Proposition 3 on 
uniqueness of Fourier series. 

g) Verify that the original orthonormal system { pJ^n /2 e lkx } is complete in 
'R. 2 ( I), so that the Fourier series of every function / e 112(1) converges to / in 
the mean on I . 

h) Let / be a function in C^ 00, (K") of period 2jt in each variable. Verify that 

Ck(f (a) ) = i M k a Ck(f), where a = (on, . . . , a„), k = (ki,...,k n ), \a\ = |ai| + 
■ ■ • + \a n |, k a = • . . . ■ kfj " , and a j are nonnegative integers. 

i) Let / be a function of class of period 2n in each variable. Show 

that if the estimate 

1 


(2tt) 

holds for each multi-index a = (aq, 
1, . . . , n), then 


- j\.f (a) \\x)Ax<M 2 


, «„ ) such that a j is 0 or m (for every j 


CM 


At '”- 2 


\f(x)-S n (x)\< 
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where N_ = min{/V| , . . . , N n ] and C is a constant depending on m but not on N or 
x el. 

j) Notice that if a sequence of continuous functions converges in mean on the 
interval / to a function / and simultaneously converges uniformly to cp, then f (x) — 
( p{x ) on 1 . 

Using this observation, prove that if a function / : R” -* C of period 2 n in 
each variable belongs to C (1, (R",C), then the trigonometric Fourier series of / 
converges to / uniformly on the entire space R" . 

13. Fourier series of generalized functions. Every 27r -periodic function / : R — > C 
can be regarded as a function f(s) of a point on the unit circle r (the point is fixed 
by the value of the arc-length parameter s, 0 < s < 2ir). 

Preserving the notation of Sect. 17.4, we consider the space 27(C) on F consist- 
ing of functions in C (oo ^(F) and the space 27' (C) of generalized functions, that is, 
continuous linear functionals on T>(r). The value of the functional F e '!>'(/’) on 
the function <p e 27(C) will be denoted F(cp), so as to avoid the symbol (F, (p) used 
in the present chapter to denote the Hermitian inner product (18.38). 

Each function / that is integrable on r can be regarded as an element of 27' (C) 
(a regular generalized function) acting on the function <p e 27(C) according to the 
formula 

n2 7T 

f(<P)= / f(s)<p(s)ds. 

Jo 

Convergence of a sequence {/*),} of generalized functions in 'D'(F) to a general- 
ized function F e T>'{r), as usual, means that for every function (p e 27(C) 

lim F n (<p) = F((p). 


a) Using the fact that by Theorem 5 the relation (p(s ) = Y^oo c k( ( P) e ‘ kx holds 
for every function <p e and, in particular, ^>(0) = show that 

in the sense of convergence in the space of generalized functions V (C) we have 


" 1 

y —e ks 

^ 2n 


oo. 


k=—n 


Here S is the element of 27' ( /’) whose effect on the function <p e 27(C) is defined 
by 8{<p) — ip(0). 

b) If / e TZ(r), the Fourier coefficients of the function / in the system {e ,iri } 
defined in the standard manner, can be written as 

1 p2lT 1 

Ck(f) = f (s)e~ ikx dx = — /(e- ifo ). 

By analogy we now define the Fourier coefficients cp ( F ) of any generalized 
function C e 27 '(C) by the formula c/fF) = F(e ~ lks ), which makes sense be- 
cause e~ ,ks e 27(C). 


18.3 The Fourier Transform 


553 


Thus to every generalized function F e T>'(r) we assign the Fourier series 

oo 

F~^]c*(F)e /fcs . 

— OO 


Show that 8 ~ X^^°oo 5F e ' ks - 

c) Prove the following fact, which is remarkable for its simplicity and the free- 
dom of action that it provides: the Fourier series of every generalized function 
F e V\r) converges to F (in the sense of convergence in the space V (F)). 

d) Show that the Fourier series of a function F e V (f) (like the function F it- 
self, and like every convergent series of generalized functions) can be differentiated 
termwise any number of times. 

e) Starting from the equality 8 — Yf^oo > find the Fourier series of 8' . 

f) Let us now return from the circle F to the line IR and study the functions e ,ks 
as regular generalized functions in T>'(W) (that is, as continuous linear functionals 
on the space X>(R) of functions in the class Cq 00 *!®) of infinitely differentiable 
functions of compact support in R). 

Every locally integrable function / can be regarded as an element of 2?'(R) 
(a regular generalized function in D'(R)) whose effect on the function ip e 
Cq°°' ) (]R, C) is given by the rule f((p) = / R fix) (fix) dx. Convergence in 2?'(R) 
is defined in the standard way: 

( lim F„ = f) :=V<p e V(R) ( lim F„(<p) = F(<p)\. 

Show that the equality 

. OO OO 

— J2e ikx = J2 s ^~ 2 ^ k ) 

— OO — OO 

holds in the sense of convergence in D\ R). In both sides of this equality a limiting 
passage is assumed as n —> oo over symmetric partial sums Yl-n , an d S(x — xq), 
as always, denotes the 5-function of "D'(R) shifted to the point xo, that is, 5(.r — 
xo)(<p) = <P(x o). 


18.3 The Fourier Transform 

18.3.1 Representation of a Function by Means of a Fourier 
Integral 

a. The Spectrum and Harmonic Analysis of a Function 

Let f(t) be a T -periodic function, for example a periodic signal with frequency j 
as a function of time. We shall assume that the function / is absolutely integrable 
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over a period. Expanding / in a Fourier series (when / is sufficiently regular, as we 
know, the Fourier series converges to/) and transforming that series, 

ci 0 (f) ^ 

fit) = — h 2^ «*(/) cos kw 0 t + b k (f) sin toot = 

k= 1 

oo oo 

= '^ / c kifW k<oot =c 0 + 2'^2\c k \cos(ka) 0 t + avgc k ), (18.78) 

-oo £=1 

we obtain a representation of / as a sum of a constant term y = co - the wean 
vo/ne of f over a period - and sinusoidal components with frequencies vo = / (the 
fundamental frequency), 2vo (the second harmonic frequency), and so on. In general 
the A:th harmonic component 2|q| cos (ky-t + argCjt) of the signal has frequency 

ki>o — j, cyclic frequency ka>o = 2itkvo = =f-k, amplitude 2\ck\ = J a ^ and 

phase arg ck = — arctan ^ . 

The expansion of a periodic function (signal) into a sum of simple harmonic 
oscillations is called the harmonic analysis of /. The numbers (q//); (eZ[ or 
{< 2 o(f),ak(f),bkif)’,k e N} are called the spectrum of the function (signal) /. 
A periodic function thus has a discrete spectrum. 

Let us now set out (on a heuristic level) what happens to the expansion (18.78) 
when the period T of the signal increases without bound. 

Simplifying the notation by writing l — 4 and otk=kj, we rewrite the expansion 


f(t) = J2 C k &ik * t 


as follows: 


where 


and hence 




c k = Y l f f^~ iaktdt 

Ck- = ^~ f f (t)e~ icik ‘ dt . 

it 2jt J_i 


(18.79) 


Assuming that in the limit as / — >• +oo we arrive at an arbitrary function / that 
is absolutely integrable over R, we introduce the auxiliary function 


c(a) = 


1 

2 it 



/(r)e _ '“ f dr, 


(18.80) 
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a/, differ only slightly from the quantities cy -j- in formula 

oo 

/(0^£>(a*)e'“* f y, (18.81) 

— OO 

where oik = k j and Uk+\ — oik = j ■ This last integral resembles a Riemann sum, 
and as the partition is refined, which occurs as / oo, we obtain 

/ OO 

c(a)e ,Q " da. (18.82) 

-OO 

Thus, following Fourier, we have arrived at the expansion of the function / into 
a continuous linear combination of harmonics of variable frequency and phase. 

The integral (18.82) will be called the Fourier integral below. It is the continuous 
equivalent of a Fourier series. The function c(a) in it is the analog of the Fourier 
coefficient, and will be called the Fourier transform of the function / (defined on the 
entire line R). Formula (18.80) for the Fourier transform is completely equivalent to 
the formula for the Fourier coefficients. It is natural to regard the function c(a) as 
the spectrum of the function (signal) /. In contrast to the case of a periodic signal / 
considered above and the discrete spectrum (of Fourier coefficients) corresponding 
to it, the spectrum c(a) of an arbitrary signal may be nonzero on whole intervals 
and even on the entire line ( continuous spectrum ). 


whose values at points a = 
(18.79). In that case 


Example 1 Let us find the function having the following spectrum of compact sup- 
port: 


c(a) 


h , if |a| < a, 
0, if |a| > a. 


(18.83) 


Proof By formula (18.82) we find, for f ^ 0 
f (t) = ( he ,0 “ da = h- 

J —a 


it 


= 2/z- 


sinaf 


(18.84) 


and when t = 0, we obtain /( 0) = 2 ha, which equals the limit of 2 as 
t -> 0. □ 


The representation of a function in the form (18.82) is called its Fourier integral 
representation. We shall discuss below the conditions under which such a represen- 
tation is possible. Right now, we consider another example. 

Example 2 Let P be a device having the following properties: it is a linear signal 
transform, that is, P(J2 j a j fj) = H / a j an d it preserves the periodicity of 

a signal, that is, P(e ,mt ) — p{w)& ,u,t , where the coefficient pica) depends on the 
frequency oo of the periodic signal e' a,t . 
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We use the compact complex notation, although of course everything could be 
rewritten in terms of cos cot and sin cot . 

The function p(co ) =: R(co)e 1<p ^ is called the spectral characteristic of the de- 
vice P . Its absolute value R(o>) is usually called the frequency characteristic and its 
argument (p(a>) the phase characteristic of the device P. A signal e'" ? , after passing 
through the device, emerges transformed into the signal R(<w)e' ( ® r+ip( ") ) , its ampli- 
tude changed as a result of the factor R(co) and its phase shifted due to the presence 
of the term <p{co). 

Let us assume that we know the spectral characteristic p(co) of the device P and 
the signal f(t) that enters the device; we ask how to find the signal x(t ) = P( f )(t) 
that emerges from the device. 

Representing the signal /(f) as the Fourier integral (18.82) and using the linear- 
ity of the device and the integral, we find 


x(t) = P(f)(t) = 



c(co)p(co)e ia,t 


d co. 


In particular, if 


p(of = 


1 

0 


for \co\< £2 , 
for |® | > £2, 


(18.85) 


then 

/ Q 

c(co)e la>t d co 
-n 

and, as one can see from the spectral characteristics of the device, 


P(e i0Jt ) 


e'" r for \u>\<£2, 
0 for \a>\ > £2 . 


A device P with the spectral characteristic (18.85) transmits (filters) frequencies 
not greater than £2 without distortion and truncates all of the signal involved with 
higher frequencies (larger than £2). For that reason, such a device is called an ideal 
low-frequency filter (with upper frequency limit £2) in radio technology. 

Let us now turn to the mathematical side of the matter and to a more careful study 
of the concepts that arise. 


b. Definition of the Fourier Transform and the Fourier Integral 

In accordance with formulas (18.80) and (18.82) we make the following definition. 
Definition 1 The function 

1 f 00 

nm)-=—\ /(x)e-'^dx (18.86) 

j _ oc 

is the Fourier transform of the function / : M —*■ C. 
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The integral here is understood in the sense of the principal value 

/ oo i>A 

f(x)e~ l ^ x dx := lim / f(x)e~‘^ x dx, 

-oo A — >+00 J —A 

and we assume that it exists. 

If / : R — » C is absolutely integrable on R, then, since \f(x)e~ ,x ^\ = |/(x)| for 
x, £ e R, the Fourier transform (18.86) is defined, and the integral (18.86) converges 
absolutely and uniformly with respect to § on the entire line R. 


Definition 2 If c(§) = T[f ](£) is the Fourier transform of / : R — > C, then the 
integral assigned to /, 

/ OO 

df, (18.87) 

-OO 

understood as a principal value, is called the Fourier integral of /. 


The Fourier coefficients and the Fourier series of a periodic function are thus the 
discrete analog of the Fourier transform and the Fourier integral respectively. 

Definition 3 The following integrals, understood as principal values, 

FcUm := - r fix) cosfxdx, (18.88) 

J —00 

wm := - r fix) sinfxdx, (18.89) 

J —00 

are called respectively the Fourier cosine transform and the Fourier sine transform 
of the function /. 

Setting elf ) = Tim), a(f ) = F c [fm , and b(f ) = T s [fm, we obtain the 
relation that is already partly familiar to us from Fourier series 

= (18-90) 

As can be seen from relations (18.88) and (18.89), 


a(-£)=a(§), &(-£) = -&(£). (18.91) 

Formulas (18.90) and (18.91) show that Fourier transforms are completely de- 
termined on the entire real line R if they are known for nonnegative values of the 
argument. 
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From the physical point of view this is a completely natural fact - the spectrum 
of a signal needs to be known for frequencies to> 0; the negative frequencies a in 
(18.80) and (18.82) - result from the form in which they are written. Indeed, 

[ A c($)e ijcf d$ = ( + f A )c( ^ d£ = f A (c( $)e ijc * + c(-$)e f * f ) d§ = 

J-A \J —A JO / Jo 

r A 

— / («(£) cosxf + b(i; ) sinx§) d£, 

Jo 

and hence the Fourier integral (18.87) can be represented as 

(a(|) cosxf + b{%) sinx£) d£, (18.87') 

which is in complete agreement with the classical form of a Fourier series. If the 
function / is real-valued, it follows from formulas (18.90) and (18.91) that 

c(-$) = ^($), (18.92) 

since in this case a($) and b($) are real-valued functions on M, as one can see 
from their definitions (18.88) and (18.89). On the other hand, under the assumption 
f(x) = /(x), Eq. (18.92) can be obtained immediately from the definition (18.86) 
of the Fourier transform, if we take into account that the conjugation sign can be 
moved under the integral sign. This last observation allows us to conclude that 

(18.93) 



for every function / : R. — > C. 

It is also useful to note that if / is a real-valued even function, that is, /(x) = 
/(*) = /(-*), then 


J r cUm=J r dfm, wm= o, 

(18.94) 


and if / is a real-valued odd function, that is, /(x) = f(x) = — /(— x), then 


W]($)=o, J r s [fm=J r sUm, 

(18.95) 

nm)=-nfm=nn{-H)\ 


and if / is a purely imaginary function, that is, /(x) = — /(x), then 




(18.96) 
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We remark that if / is a real- valued function, its Fourier integral (18.87') can 
also be written as 


/* OO / /* oo 

/ JaHH) + b 2 (H) cos(x§ + ^))df =2 / |c($)|cos(x$ + «!>($)) 

Jo Jo 


d$. 


where ^>(£) = — arctan = argc(£). 


Example 3 Let us find the Fourier transform of /(f) = (assuming /( 0) = 
a e R). 


F[f](a)= lim — 

A — >-}-oo Z7T J a t 


= lim — 


Jr J-. 

Lf 

■7t J_, 


sm at 


dr = 


sinafcosaf 2 
- df = — 


A — *--)-oo 27T J ^ f 27T 

, ' +00 / sin(a + a)t sin(a — a)f 

V f r 


r+o o 

Jo 


sinaf cosaf 


df = 


= -T 

2?r Jo 

1 . , f 00 sin// Asgna, if|ar|<|a|, 

= — (sgn(a + Q!)-|-sgn(a-a)) / du = \ ~ 

2n Jo u 0, if a > a , 


since we know the value of the Dirichlet integral 


sin a 7 r 

/ du = — 

Jo « 2 


(18.97) 


Hence if we assume a > 0 and take the function /(f) = 2 h^j^~ 0 f Eq. (18.84), 
we find, as we should have expected, that the Fourier transform is the spectrum of 
this function exhibited in relations (18.83). 

The function / in Example 3 is not absolutely integrable on R, and its Fourier 
transform has discontinuities. That the Fourier transform of an absolutely integrable 
function has no discontinuities is attested by the following lemma. 


Lemma 1 If the function / :I->Cis locally integrable and absolutely integrable 
on R, then 

a) its Fourier transform T[f ](£) is defined for every value £ e R; 

b) J[/]gC(K,C); 

c) sup f |.F[g](£)| < ^/!^|/(x)|dx; 

d) nmi^Oasi^oo. 

Proof We have already noted that \f(x)e , x ^\ < |/(x)|, from which it follows that 
the integral (18.86) converges absolutely and uniformly with respect to £ e R. This 
fact simultaneously proves parts a) and c). 

Part d) follows from the Riemann-Lebesgue lemma (see Sect. 18.2). 
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For a fixed finite A > 0, the estimate 
I C A 

\ f(x)(s~ ix ^ +h) d.v 

I J-A 


< sup 
hl<A 


e -ixh 



dx 


establishes that the integral 

1 C A 

— / f{x)e~ ,Xk dx, 

2tt J_ a J 

is continuous with respect to £ ; and the uniform convergence of this integral as 
A — >■ +oo enables us to conclude that T[f] e C(M, C). □ 


Example 4 Let us find the Fourier transform of the function /(f) = e ,2//2 : 

/ +°° 2 r+oo 

e _r / 2 e _, “' d t— I e~' ^cosafdf. 

-oo J — oo 

Differentiating this last integral with respect to the parameter a and then inte- 
grating by parts, we find that 

d T[f] 

-^(«)+aJ[/](«) = 0, 
da 

or 

ln~F[/](a) = —a. 
da 

It follows that F[f](a) = ce ““"V/ where c is a constant which, using the Euler- 
Poisson integral (see Example 17 of Sect. 17.2) we find from the relation 

/ +oo 

e“ r/2 df = V2^r. 

-OO 

Thus we have found that J-[f](a) = \/27re - “ 2 / 2 , and simultaneously shown that 
WKa) = V2^e-“ 2 / 2 and T s [f](a) = 0. 


c. Normalization of the Fourier Transform 

We obtained the Fourier transform (18.80) and the Fourier integral (18.82) as the 
natural continuous analogs of the Fourier coefficients Ck = f(x)e~ ,kx dx 

and the Fourier series Cke' kx of a periodic function / in the trigonometric 

system {&' kx \ k e Z}. This system is not orthonormal, and only the ease of writing 
a trigonometric Fourier series in it has caused it to be used traditionally instead of 
the more natural orthonormal system {—k=z lkx \ k £ Z}. In this normalized system 

\/2tz 
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the Fourier series has the form ct—)=e,' kx , and the Fourier coefficients are 

defined by the formulas q = -^== f(x)e~' kx dx. 

The continuous analogs of such natural Fourier coefficients and such a Fourier 
series would be the Fourier transform 


/«) := 



(18.98) 


and the Fourier integral 


fix) = 



f(S)e ix * d$, 


(18.99) 


which differ from those considered above only in the normalizing coefficient. 

In the symmetric formulas (18.98) and (18.99) the Fourier “coefficient” and the 
Fourier “series” practically coalesce, and so in the future we shall essentially be 
interested only in the properties of the integral transform (18.98), calling it the nor- 
malized Fourier transform or, where no confusion can arise, simply the Fourier 
transform of the function /. 

In general the name integral operator or integral transform is customarily given 
to an operator A that acts on a function / according to a rule 


A(f)(y)= I K(x,y)f(x) dx, 
Jx 


where K(x,y ) is a given function called the kernel of the integral operator, and 
X C M" is the set over which the integration extends and on which the integrands 
are assumed to be defined. Since y is a free parameter in some set Y , it follows that 
A( f) is a function on Y. 

In mathematics there are many important integral transforms, and among them 
the Fourier transform occupies one of the most key positions. The reasons for this 
situation go very deep and involve the remarkable properties of the transformation 
(18.98), which we shall to some extent describe and illustrate in action in the re- 
maining part of this section. 

Thus, we shall study the normalized Fourier transform (18.98). 

Along with the notation / for the normalized Fourier transform, we introduce 
the notation 


m ■= 


i 

V2 jv 



/(x)e' ,fjc dx, 


(18.100) 


that is, /(§) = /(—§). 

Formulas (18.98) and (18.99) say that 


/=/ = /, (18.101) 

that is, the integral transforms (18.98) and (18.99) are mutually inverse to each other. 
Hence if (18.98) is the Fourier transform, then it is natural to call the integral oper- 
ator (18.100) the inverse Fourier transform. 
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We shall discuss in detail below certain remarkable properties of the Fourier 
transform and justify them. For example 

f*g = V2jrf-g, 

11/11 = 11 / 11 . 

That is, the Fourier transform maps the operator of differentiation into the op- 
erator of multiplication by the independent variable; the Fourier transform of the 
convolution of functions amounts to multiplying the transforms; the Fourier trans- 
form preserves the norm (Parseval’s equality), and is therefore an isometry of the 
corresponding function space. 

But we shall begin with the inversion formula (18.101). 

For another convenient normalization of the Fourier transform see Problem 10 
below. 


d. Sufficient Conditions for a Function to be Representable as a Fourier 
Integral 

We shall now prove a theorem that is completely analogous in both form and con- 
tent to the theorem on convergence of a trigonometric Fourier series at a point. 
To preserve the familiar appearance of our earlier formulas and transformations to 
the maximum extent, we shall use the nonnormalized Fourier transform c(§) in the 
present part of this subsection, together with its rather cumbersome but sometimes 
convenient notation .F[/](£). Afterwards, when studying the integral Fourier trans- 
form as such, we shall as a rule work with the normalized Fourier transform / of 
the function /. 

Theorem 1 (Convergence of the Fourier integral at a point) Let f : R — > C be an 
absolutely integrable function that is piecewise continuous on each finite closed 
interval of the real axis R. 

If the function f satisfies the Dini conditions at a point x e R, then its Fourier 
integral (18.82), (18.87), (18. 87'), (18.99) converges at that point to the value 
\(f(x~) + f(x+)), equal to half the sum of the left and right-hand limits of the 
function at that point. 

Proof By Lemma 1 the Fourier transform c(f ) = IF[f](fi) of the function / is con- 
tinuous on R and hence integrable on every interval [—A, A], Just as we transformed 
the partial sum of the Fourier series, we now carry out the following transformations 
of the partial Fourier integral: 

Sa(x) = y*c($)e izf d£ = f A A (J^ /°°/(0e^di)e^d,= 
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= — f f(t)( [ ( 

2tt J- 00 JK \J- A 


e i( * _,)f ) dt = 


i r°° 

=^L m 


e i(x-r)A _ e -i(x-t)A 


i ( x — t ) 


df 


1 sin(x — t)A 

- fit)— — d t = - 

ft J — 00 


x — t 


T J-i 


sin Au 

f (x + u) du — 

-T J — oo H 


1 . .sinAn 

— / (f(x — u) + f(x + m)I dw. 

it Jo u 


The change in the order of integration at the second equality from the beginning 
of the computation is legal. In fact, in view of the piecewise continuity of /, for 
every finite B > 0 we have the equality 


f A fJ_ I* 

J-a\2jt J- 


— I /(f)e-^df )e ! ^dt = — 


l — B 




from which as B — ► +oo, taking account of the uniform convergence of the integral 
/_f g f(x)e~"^ dr with respect to we obtain the equality we need. 

We now use the value of the Dirichlet integral (18.97) and complete our transfor- 
mation: 

c , , f(x-) + f(x+) 

1 f + °° (fix - u) - fix-)) + (fix + u)~ f(x+)) . 

— —I smAudu. 

Jt Jo u 


The resulting integral tends to zero as A — ► oo. We shall explain this and thereby 
finish the proof of the theorem. 

We represent this integral as the sum of the integrals over the interval ]0, 1] and 
over the interval [1, +oo[. The first of these two integrals tends to zero as A — > +oo 
in view of the Dini conditions and the Riemann-Lebesgue lemma. The second inte- 
gral is the sum of four integrals corresponding to the four terms fix — u), fix + w), 
fix-) and fix+). The Riemann-Lebesgue lemma applies to the first two of these 
four integrals, and the last two can be brought into the following form, up to a con- 
stant factor: 

f +oc sinAn f +oc sinu 

/ du — / du. 

J] u J A v 

But as A +oo this last integral tends to zero, since the Dirichlet integral 
(18.97) converges. □ 


Remark 1 In the proof of Theorem 1 we have actually studied the convergence 
of the integral as a principal value. But if we compare the notations (18.87) and 
(18.87'), it becomes obvious that it is precisely this interpretation of the integral that 
corresponds to convergence of the integral (18.87'). 
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From this theorem we obtain in particular 

Corollary 1 Let f : R — >• C be a continuous absolutely integrable function. 

If the function f is differentiable at each point retar has finite one-sided 
derivatives or satisfies a Holder condition, then it is represented by its Fourier inte- 
gral. 

Hence for functions of these classes both equalities (18.80) and (18.82) or (18.98) 
and (18.99) hold, and we have thus proved the inversion formula for the Fourier 
transform for such functions. 

Let us consider several examples. 

Example 5 Assume that the signal i ft) — P(f)(t) emerging from the device P 
considered in Example 2 is known, and we wish to find the input signal fit) entering 
the device P . 

In Example 2 we have shown that / and v are connected by the relation 



where c(oj) = T[f ](&>) is the spectrum of the signal F (the nonnormalized Fourier 
transform of the function/) and p is the spectral characteristic of the device P. 
Assuming all these functions are sufficiently regular, from the theorem just proved 
we conclude that then 


c(a>)p(a > ) = F[v]{co). 


From this we find c(co) = J-[f~\(co). Knowing c(u>), we find the signal / using 
the Fourier integral (18.87). 

Example 6 Let a > 0 and 


e ax forx > 0, 
= 0 for x < 0. 


Then 



In discussing the definition of the Fourier transform, we have already noted a 
number of its obvious properties in Part b of the present subsection. We note further 
that if f~{x ) := /(— x), then T[f- ](£) = £). This is an elementary change 

of variable in the integral. 

We now take the function e _a ^ = /(x) + /(— x) =: ip(x). 


Then 


nm) = nm) + nm-s) = - 

tt a z + §- 
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If we now take the function f (x ) = fix) — f(—x), which is an odd extension 
of the function e ~ ax , x > 0, to the entire real line, then 


rwm = nm) - nm-s) = 

i r + f z 

Using Theorem 1, or more precisely the corollary to it, we find that 

e~ ax , if x > 0, 
if x = 0, 

0, if x < 0; 


-f 

2jt J _ 


OO g/xf 


27 r J_ 0 o a + if 


df = 


1 

7 r 

i 

7 r 



fl 2 + f 2 

fe'^ 

a 2 + f 2 


_ p-«|x|. 


df = e 


df = 


e 

0, 


-e“, 


if x > 0, 
if x = 0, 
if x < 0. 


All the integrals here are understood in the sense of the principal value, although 
the second one, in view of its absolute convergence, can also be understood in the 
sense of an ordinary improper integral. 

Separating the real and imaginary parts in these last two integrals, we find the 
Laplace integrals we have encountered earlier 



CQSXf _ 7T 
fl 2 + f 2 d? - 2fl e ’ 


Sinxf d? _^ e -q W 


sgnx. 


Example 7 On the basis of Example 4 it is easy to find (by an elementary change of 
variable) that if 


fix) = e a ~ x2 , then /(f) = — L- e ^ . 

V2 a 

It is very instructive to trace the simultaneous evolution of the graphs of the func- 
tions / and / as the parameter a varies from 1 /-s/2 to 0. The more “concentrated” 
one of the functions is, the more “smeared” the other is. This circumstance is closely 
connected with the Heisenberg uncertainty principle in quantum mechanics. (In this 
connection see Problems 6 and 7.) 

Remark 2 In completing the discussion of the question of the possibility of repre- 
senting a function by a Fourier integral, we note that, as Examples 1 and 3 show, the 
conditions on / stated in Theorem 1 and its corollary are sufficient but not necessary 
for such a representation to be possible. 
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18.3.2 The Connection of the Differential and Asymptotic 
Properties of a Function and Its Fourier Transform 

a. Smoothness of a Function and the Rate of Decrease of Its Fourier 
Transform 

It follows from the Riemann-Lebesgue lemma that the Fourier transform of any 
absolutely integrable function on R tends to zero at infinity. This has already been 
noted in Lemma 1 proved above. We now show that, like the Fourier coefficients, 
the smoother the function, the faster its Fourier transform tends to zero. The dual 
fact is that the faster a function tends to zero, the smoother its Fourier transform. 
We begin with the following auxiliary proposition. 

Lemma 2 Let f : R — > C be a continuous function having a locally piecewise con- 
tinuous derivative f' on R. Given this, 

a) if the function f is integrable on R, then f(x) has a limit both as x — »■ — oo 
and as — >■ +oo; 

b) if the functions f and f are integrable on R, then f(x) —> 0 as —*■ oo. 

Proof Under these restrictions on the functions / and f the Newton-Leibniz for- 
mula holds 

/(*) = /( 0)+f f(t)dt. 

Jo 

In conditions a) the right-hand side of this equality has a limit both as x — > +oo 
and as x —*■ —oo. 

If a function / having a limit at infinity is integrable on R, then both of these 
limits must obviously be zero. □ 

We now prove 

Proposition 1 (Connection between the smoothness of a function and the rate of 
decrease of its Fourier transform) If f e C® (R, C) (A: = 0, 1, . . .) and all the func- 
tions are absolutely integrable on R, then 

a) for every ne(0,l,...,I} 


/W(f) = (if )"/(?) , (18-102) 


b) m = o^)as^ 0. 


Proof If k = 0, then a) holds trivially and b) follows from the Riemann-Lebesgue 
lemma. 
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Let k > 0. By Lemma 2 the functions /, /', . . . , f <k 1 1 tend to zero as x — » oo. 
Taking this into account, we integrate by parts, 

i roo 

/®(f) := -j= / f (k \x)e~^ x dx = 
v 2,71 J — oo 

= -L(/ ( *-> W e-«'|-. + ««) dx) = 

coo 

= • ■ • = 4= / /(*)e-' ? * dx = (if )*/«)■ 

\Jah J— oo 

Thus Eq. (18.102) is established. This is a very important relation, and we shall 
return to it. 

We have shown that /(f) = (if)~*/M(f), but by the Riemann-Lebesgue 
lemma /W(f) — >■ 0 as f — >• 0 and hence b) is also proved. □ 


b. The Rate of Decrease of a Function and the Smoothness of Its Fourier 
Transform 

In view of the nearly complete identity of the direct and inverse Fourier transforms 
the following proposition, dual to Proposition 1, holds. 

Proposition 2 (The connection between the rate of decrease of a function and the 
smoothness of its Fourier transform) If a locally integrable function f : R —*■ C is 
such that the function x k f(x) is absolutely integrable on R, then 

a) the Fourier transform of f belongs to C®(M, C). 

b) the following equality holds: 

f (k> (f) = {-i) k x*7&m. (18.103) 

Proof For k = 0 relation (18.103) holds trivially, and the continuity of /(f) has 
already been proved in Lemma 1. If k > 0, then for n < k we have the estimate 
|x" / (x)| < \x k f(x)\ at infinity, from which it follows that x n fix) is absolutely 
integrable. But \x n fix)e~ l ^ x \ — |x" /(x)|, which enables us to invoke the uniform 
convergence of these integrals with respect to the parameter f and successively dif- 
ferentiate them under the integral sign: 

1 C°° 

fm=-z= /(x)e-^dx, 

V 2tt J — oo 

A j POO 

/'(§)=-==/ xf(x)e-'S x dx, 
v2tt J — oo 
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(_;\k foo 

f (k) (?) = — / x k f{x)e~* x dx. 
\A7T J — oo 


By Lemma 1 this last integral is continuous on the entire real line. Hence indeed 

/eC (i) (R,C). □ 


c. The Space of Rapidly Decreasing Functions 

Definition 4 We denote the set of functions f e C ( ' DO) (R, C) satisfying the condi- 
tion 

sup|x^/ ( " ) (x)| < oo 
xeR 

for all nonnegative integers a and f by S(R, C) or more briefly by S. Such functions 
are called rapidly decreasing functions (as x — > oo). 

The set of rapidly decreasing functions obviously forms a vector space under the 
standard operations of addition of functions and multiplication of a function by a 
complex number. 

_ 2 

Example 8 The function e x or, for example, all functions of compact support in 
Cq°°\R, C) belong to S. 

Lemma 3 The restriction of the Fourier transform to S is a vector-space automor- 
phism of S. 

Proof We first show that (/ e S) => (/ e S). 

To do this we first remark that by Proposition 2a we have f e C (oci (M, C). 

We then remark that the operation of multiplication by x a (a > 0) and the oper- 
ation D of differentiation do not lead outside the class of rapidly decreasing func- 
tions. Hence, for any nonnegative integers a and ft the relation f e S implies that 
the function 1)^’ (x a f (x)) belongs to the space S. Its Fourier transform tends to zero 
at infinity by the Riemann-Lebesgue lemma. But by formulas (18.102) and (18.103) 

D^^Jix))^) = 

and we have shown that £ ^/ ( “)(f ) — > 0 as £ — > oo, that is, / e S. 

We now show that S — S, that is, that the Fourier transform maps S onto the 
whole space S. 

We recall that the direct and inverse Fourier transforms are connected by the 
simple relation /(£) = /(—£). Reversing the sign of the argument of the function 
obviously is an operation that maps the set S into itself. Hence the inverse Fourier 
transform also maps S into itself. 

Finally, if / is an arbitrary function in S, then by what has been proved q> — f e S 
and by the inversion formula (18. 101) we find that f = <p. 
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The linearity of the Fourier transform is obvious, so that Lemma 3 is now com- 
pletely proved. □ 


18.3.3 The Main Structural Properties of the Fourier Transform 


a. Definitions, Notation, Examples 

We have made a rather detailed study above of the Fourier transform of a function 
/ : K — > C defined on the real line. In particular, we have clarified the connection 
that exists between the regularity properties of a function and the corresponding 
properties of its Fourier transform. Now that this question has been theoretically an- 
swered, we shall study the Fourier transform only of sufficiently regular functions 
so as to exhibit the fundamental technical properties of the Fourier transform in con- 
centrated form and without technical complications. In compensation we shall con- 
sider not only one-dimensional but also the multi-dimensional Fourier transform and 
derive its basic properties practically independently of what was discussed above. 

Those wishing to confine themselves to the one-dimensional case may assume 
that n = 1 below. 


Definition 5 Suppose / : R" C is a locally integrable function on R" . The func- 
tion 

/(?) : = 7^72 f (18.104) 

(2;r )"/ 2 J R n 

is called the Fourier transform of the function f . 


Here we mean that x = (xi, . . . , x„), § = (£i, . . . , £„), (£, x) = %\X\ 4 l-£„x„, 

and the integral is regarded as convergent in the following sense of principal value: 



(p(x i, . . . , x n ) dxi • • • dx ;! 


lim 

A— >+oo 



</£>(*! x n ) dx i • • • dx„ 


In this case the multidimensional Fourier transform (18.104) can be regarded as n 
one-dimensional Fourier transforms carried out with respect to each of the variables 
xi, ... ,x n . 

Then, when the function / is absolutely integrable, the question of the sense in 
which the integral (18.104) is to be understood does not arise at all. 

Let a — (oq, . . . , a n ) and f , fi n ) be multi-indices consisting of non- 

negative integers aj.fj, j — \ ,n. and suppose, as always, that D a denotes 
the differentiation operator — gl al \ g)l of order |or | := oq + • • • + a n and x^ := 

3 Xi "'dXfi 
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Definition 6 We denote the set of functions f e C <OOI (R", C) satisfying the condi- 
tion 

sup \x^D a f(x)\ < oo 

xeM" 

for all nonnegative multi-indices a and /I by the symbol S ( R" , C) , or by S where no 
confusion can arise. Such functions are said to be rapidly decreasing (as x — » oo). 

The set S with the algebraic operations of addition of functions and multiplica- 
tion of a function by a complex number is obviously a vector space. 

Example 9 The function e - ^ 2 , where \x\ 2 — x 2 H f x 2 , and all the functions in 

Cq 00 ^®", C) of compact support belong to S. 

If f e S. then integral in relation (18.104) obviously converges absolutely and 
uniformly with respect to § on the entire space M". Moreover, if / e S. then by stan- 
dard rules this integral can be differentiated as many times as desired with respect 
to any of the variables £i, . . . , Thus if / e S, then / e C). 


Example 10 Let us find the Fourier transform of the function exp(— |x|~/2). When 
integrating rapidly decreasing functions one can obviously use Fubini’s theorem and 
if necessary change the order of improper integrations without difficulty. 

In the present case, using Fubini’s theorem and Example 4, we hnd 


1 


(2jr) n / 2 


/ 

JW' 


s -|*|*/2. e -i«,x) djc = 


n tsL 


~ x j/2 -i^j: 


j dxj — J - [ e — e ^/ 2 . 


,/=i 


We now state and prove the basic structural properties of the Fourier transform, 
assuming, so as to avoid technical complications, that the Fourier transform is being 
applied to functions of class S. This is approximately the same as learning to operate 
(compute) with rational numbers rather than the entire space R all at once. The 
process of completion is of the same type. On this account, see Problem 5. 


b. Linearity 

The linearity of the Fourier transform is obvious; it follows from the linearity of the 
integral. 


c. The Relation Between Differentiation and the Fourier Transform 


The following formulas hold 
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(18.106) 


Proof The first of these can be obtained, like formula (18.102), via integration by 
parts (of course, with a preliminary use of Fubini’s theorem in the case of a space R" 
of dimension n > 1). 

Formula (18.106) generalizes relation (18.103) and is obtained by direct differ- 
entiation of ( 1 8 . 1 04) with respect to the parameters £i , . . . , £ n • □ 

Remark 3 In view of the obvious estimate 

|/(£)|< [ I f(x)\ dx < +oo, 

l_ (2jr) n / 2 J R n 

it follows from (18.105) that f(j~) 0 as £ — » oo for every function f e S , since 

D a feS. 

Next, the simultaneous use of formulas (18.105) and (18.106) enables us to write 
that 

Z^(xVW)(£) = (j)M+M^D“/G), 

from which it follows that if / e 5, then for any nonnegative multi-indices a and /3 
we have D a f (£) -» 0 when £ oo in R" . Thus we have shown that 

(feS)^(feS). 


d. The Inversion Formula 

Definition 7 The operator defined (together with its notation) by the equality 

/~^) : =Tt4t2 f /We'^dx, (18.107) 

(2? r)”/ 2 J R n 

is called the inverse Fourier transform. 

The following Fourier inversion formula holds: 

/=/=/, (18-108) 

or in the form of the Fourier integral: 

fW = 7r^n[ (18.109) 

(2jr)"/ 2 J 

Using Fubini’s theorem one can immediately obtain formula (18.108) from the 
corresponding formula (18.101) for the one-dimensional Fourier transform, but, as 
promised, we shall give a brief independent proof of the formula. 
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Proof We first show that 

[ s(£)/($)e i( *-« d£ = [ g(?)/(x + y) dy 
JR" JR" 


(18.110) 


for any functions /, g e S(R, C). Both integrals are defined, since f,geS and so 
by Remark 3 we also have f,geS. 

Let us transform the integral on the left-hand side of the equality to be proved: 

[ g^/^e'^Mf = 

JR" 

=l s «<(27W/«. /we " iffi ' , ' dv ) e, ' , ' i)df = 

= [ g(y~x)f(y) dy = [ g(y)f(x + y)dy. 

JR" JR" 


There is no doubt as to the legitimacy of the reversal in the order of integration, 
since / and g are rapidly decreasing functions. Thus (18.110) is now verified. 

We now remark that for every e > 0 


X — 2 f g&w^ d§ = 1 [ g(u)er i(y ' ulE) d U = e~ n g(y/s), 

) n/1 J R" (2 n) n l 2 ° n J R " 


(2 it) 

so that, by Eq. (18.110) 


[ g(e%)f(%)e' (x ’^^=f £ "g(y/e)f(x + y)dy= f g(u)f(x + eu)du. 
JR" JR" JR" 


Taking account of the absolute and uniform convergence with respect to s of the 
extreme integrals in the last chain of equalities, we find, as e — > 0, 

g( 0) / ftfW (x ’Vd$ = f(x) f g(u) du . 

JR" JR" 


Here we set g(x) — e Lk/2 j n Example 10 we saw that g(u) = e i"l 2 / 2 . Recall- 
ing the Euler-Poisson integral e~ x ~ dx = ffr and using Fubini’s theorem, we 
conclude that / R „ e,-\ u \~l 2 du = {2n) n l 2 , and as a result, we obtain Eq. (18.109). □ 


Remark 4 In contrast to the single equality (18.109), which means that / = /, re- 
lations (18.108) also contain the equality /. But this relation follows immediately 
from the one proved, since /(§) = / (— £) and /(— x) = /(x). 
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Remark 5 We have already seen (see Remark 3) that if / e .S', then / e S, and 
hence / e S also, that is, S C .S and S C S. We now conclude from the relations 

f — f — f that S = S — S. 


e. Parseval’s Equality 


This is the name given to the relation 


( f,g) = (f,g >, 


(18.111) 


which in expanded form means that 


f f(x)g(x)dx= f /($)*(£) d$ 
Jr" Jr n 


( 18 . 111 ' 


It follows in particular from (18.111) that 


ll/H 2 = </,/> = </,/> = II /II 2 . 


(18.112) 


From the geometric point of view, Eq. (18.1 1 1) means that the Fourier transform 
preserves the inner product between functions (vectors of the space S ), and hence is 
an isometry of S. 

The name “ParsevaTs equality” is also sometimes given to the relation 



(18.113) 


which is obtained from (18.110) by setting x = 0. The main Parseval equality 
(18.111) is obtained from (18.113) by replacing g with g and using the fact that 

(g) = g > since g = g and g = g = g. 

f. The Fourier Transform and Convolution 

The following important relations hold 


(f*g) — (2 tc )"/ 2 / • g, 

(Pg) = (2n)- n/2 f*g 


(18.114) 

(18.115) 


(sometimes called Borel’s formulas), which connect the operations of convolution 
and multiplication of functions through the Fourier transform. 

Let us prove these formulas: 

Proof 
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= ttW f (f f (x - y)sOO dyV‘'«-*> dx = 

(2jt n / z J \J r » ) 

= L s(y)e ~ my) (L f(x - *) dy = 

= otW L siy)e ~ m ' y) (L /< " ,e "®'" > du ) dy = 

= f g(y)er i( S’»fG ) dy = (2n ) n/2 A?)l(l)- 

JR" 

The legitimacy of the change in order of integration is not in doubt, given that 

f.fitS. 

Formula (18.1 15) can be obtained by a similar computation if we use the inver- 
sion formula (18. 109). However, Eq. (18.1 15) can be derived from relation (18.1 14) 

already proved if we recall that f = f = = = and that u ■ v — u-v, 

u * v = u*v. □ 


Remark 6 If we set / and g in place of / and g in formulas (18.114) and (18.115) 
and apply the inverse Fourier transform to both sides of the resulting equalities, we 
arrive at the relations 


f-g=(2jrr n/2 (f*g), 

(18.114') 

J7g = (2TT) n/2 (f -g). 

(18.115') 


18.3.4 Examples of Applications 

Let us now illustrate the Fourier transform (and some of the machinery of Fourier 
series) in action. 


a. The Wave Equation 

The successful use of the Fourier transform in the equations of mathematical physics 
is bound up (in its mathematical aspect) primarily with the fact that the Fourier 
transform replaces the operation of differentiation with the algebraic operation of 
multiplication. 

For example, suppose we are seeking a function u : R — »• R satisfying the equa- 
tion 

aQU^fx) + a\u^ n ~ \x) + ■ ■ • + a n u(x ) = f(x), 

where oq, ...,a n are constant coefficients and / is a known function. Applying the 
Fourier transform to both sides of this equation (assuming that the functions u and 
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/ are sufficiently regular), by relation (18.105) we obtain the algebraic equation 

(aoO'S)" + at O'?)" -1 + • • • + «»)«($) = /(I) 


for it. After finding «(£) = from the equation, we obtain m(x) by applying the 
inverse Fourier transform. 

We now apply this idea to the search for a function u = u (x . t ) satisfying the 
one-dimensional wave equation 


d 2 u 2 3 2 m 
3f 2 dx 2 


( a > 0) 


and the initial conditions 


u(x,0) = f(x), — (x,0) = g(x) 

at 


inlxl. 

Here and in the next example we shall not take the time to justify the intermediate 
computations because, as a rule, it is easier simply to find the required function and 
verify directly that it solves the problem posed than to justify and overcome all the 
technical difficulties that arise along the way. As it happens, generalized functions, 
which have already been mentioned, play an essential role in the theoretical struggle 
with these difficulties. 

Thus, regarding t as a parameter, we carry out a Fourier transform on x on both 
sides of the equation. Then, assuming on the one hand that differentiation with 
respect to the parameter under the integral sign is permitted and using formula 
(18.105) on the other hand, we obtain 


M"(£,f) = -a 2 £ 2 M(£,f), 


from which we find 


m(§, t) = A(£) cos a^t + B(l ; ) sina^f. 


By the initial conditions, we have 

«(£.0) = /(£) = A(£), 
u's, 0) = (<Kf, 0) = m = a$B(£). 


k(§, t) = /(!)coso£f + 


m 

at; 


sin ai-t — 




,-ia^t 


)■ 


Thus, 
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Multiplying this equality by —i — and integrating with respect to £ - in short, 

■\ 2.7T 

taking the inverse Fourier transform - and using formula (18. 105) we obtain imme- 
diately 


1 l r> 

u(x,t) = ~(f(x — at) + f(x + at)) + - / (g(x — ax) + g(x + ax)) dr. 

2 2 Jo 


b. The Heat Equation 

Another element of the machinery of Fourier transforms (specifically, formulas 
(18.114') and (18.115')) which remained in the background in the preceding ex- 
ample, manifests itself quite clearly when we seek a function u — u(x, t), x e M", 
t > 0, that satisfies the heat equation 

du 9 

— = a Au (a > 0) 
dt 

and the initial condition u(x, 0) = f(x) on all of M". 

o2 o2 

Here, as always A — H b ttt . 

J dxf dx„ 

Carrying out a Fourier transform with respect to the variable x e R" , (assuming 
that this is possible to do) we find by (18.105) the ordinary equation 

0 — a 2 (if($i H F %n)u(tj,t), 

from which it follows that 


u(£,f) = c(?)e- a2 m\ 

where \^\ 2 — ^ b Taking into account the relation u (| , 0) = / ($ ) , we find 

u($,t) = f($)-e~ a2 M 2e . 


Now applying the inverse Fourier transform, taking account of (18.114'), we 
obtain 

u(x, t) = (27t)~ n/2 [ f(y)E 0 (y - x, t) dv, 

J R n 


where Eq(x, t) is the function whose Fourier transform with respect to x is e a ~^ t . 
The inverse Fourier transform with respect to £ of the function e~' r ? is essentially 
already known to us from Example 10. Making an obvious change of variable, we 
find 


Eq{x, t) = 


1 

( 2n) n l 2 
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Setting E(x, t) = (2jt) n / 2 Eo(x,t), we find the fundamental solution 

hi 2 

E(x,t) — (2a\fnt) "e 4Rt (f>0), 

of the heat equation, which was already familiar to us (see Example 15 of 
Sect. 17.4), and the formula 

u(x, t) = (f * E)(x , t) 

for the solution satisfying the initial condition u(x, 0) = fix). 


c. The Poisson Summation Formula 

This is the name given to the following relation 


V2tt E q>(2nri) — E (pin) 


(18.116) 


between a function q > : R —*■ C (assume (p e S) and its Fourier transform q>. Formula 
(18.1 16) is obtained by setting x = 0 in the equality 


V2jt y^ (p{x + 2nn)= y^ <p(n)e'" x , 


(18.117) 


which we shall prove assuming that q> is a rapidly decreasing function. 


Proof Since <p and (p both belong to S , the series on both sides of (18.117) con- 
verge absolutely (and so they can be summed in any order), and uniformly with 
respect to x on the entire line R. Moreover, since the derivatives of a rapidly de- 
creasing function are themselves in class S, we can conclude that the function 
f(x) — YJnL-oo V( x + 27 rn) belongs to C (oo) (R, C). The function / is obviously 
of period 2n . Let {q-(/)} be its Fourier coefficients in the orthonormal system 
{ -J=e lfcc ; k e Z), then 

y/2n 


c k (f) := 


i r 2n 

V2jt Jo 


/(x)e ,kx dx = 


°° i /» 2tz (. 

= y — 

\J 2lt Jlytn 


2n (n+ 1) 


00 ^ j p 2n 

,?LV2^Jo 


1 r 

V2jt J-c 


(p(x + 2jti'i)e lkx dx = 


<p(x)e lkx dx = 


(p(x)e ,kx dx =: (p(k). 


But / is a smooth 2jt -periodic function and so its Fourier series converges to it 
at every point x e R. Hence, at every point x e R we have the relation 


E <p(x + 2jrn) = f(x) = y c n (f) 


1 


sFhi V2 □ 
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Remark 7 As can be seen from the proof, relations (18.116) and (18.117) by no 
means hold only for functions of class S. But if <p does happen to belong to S, then 
Eq. (18.117) can be differentiated termwise with respect to x any number of times, 
yielding as a corollary new relations between <p,<p ', and q>. 


d. Kotel’nikov’s Theorem ( Whittaker-Shannon Sampling Theorem ) 22 

This example, based like the preceding one on a beautiful combination of the Fourier 
series and the Fourier integral, has a direct relation to the theory of information 
transmission in a communication channel. To keep it from appearing artificial, we 
recall that because of the limited capabilities of our sense organs, we are able to 
perceive signals only in a certain range of frequencies. For example, the ear “hears” 
in the range from 20 Hz to 20 kHz. Thus, no matter what the signals are, we, like 
a filter (see Sect. 18.3.1) cut out only a bounded part of their spectra and perceive 
them as band-limited signals (having a bounded spectrum). 

For that reason, we shall assume from the outset that the transmitted or received 
signal fit) (where t is time, — oo < t < oo) is band-limited, the spectrum being 
nonzero only for frequencies whose magnitudes do not exceed a certain critical 
value a > 0. Thus f (o>) = 0 for \a>\ > a, and so for a band-limited function the 
representation 

If 00 . 

f(t) = — f(coW wt dco 

v 2.u J— oo 

reduces to the integral over just the interval [—<7, a]: 

f(t) = ^=f ficoW^dco. (18.118) 

\JL7l J —a 

On the closed interval [—a, a] we expand the function /(&>) in a Fourier series 

OO 

^ v > ^ ; km i 

f(co) = J2 c k(fW “ k (18.119) 

— OO 


• 7TO) J 

in the system {e " \ k e Z} which is orthogonal and complete in that interval. Tak- 
ing account of formula (18.118), we find the following simple expression for the 
coefficients Ck(f) of this series: 

~ 1 r a - :nu> L. \fljt l 7t \ 

Ckif) : = Z- / /(®)e- ! ^d«-a=2_/ k ). (18.120) 

2 a J_ a 2a \ a J 


25 V.A. Kotel’nikov (b. 1908) - Soviet scholar, a well-known specialist in the theory of radio 
communication. 

J.M. Whittaker (1905-1984) - British mathematician who worked mainly in complex analysis. 
C.E. Shannon (1916-2001) - American mathematician and engineer, one of the founders of 
information theory and inventor of the term “bit” as an abbreviation of “binary digit”. 
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Substituting the series (18.1 19) into the integral (18.118), taking account of rela- 
tions (18.120), we find 



d a> — 



Calculating these elementary integrals, we arrive at Kotel’nikov’ s formula 



(18.121) 


Formula (18.121) shows that, in order to reconstruct a message described by a 
band-limited function f(t) whose spectrum is concentrated in the frequency range 
\w\ < a, it suffices to transmit over the channel only the values f(kA) (called 
marker values) of the function at equal time intervals A — n /a. 

This proposition, together with formula (18.121) is due to V.A. Kotel’nikov and 
is called Kotel’nikov’ s theorem or the sampling theorem. 

Remark 8 The interpolation formula (18.121) itself was known in mathematics be- 
fore Kotel’nikov’s 1933 paper, but this paper was the first to point out the fundamen- 
tal significance of the expansion (18.121) for the theory of transmission of continu- 
ous messages over a communication channel. The idea of the derivation of formula 
(18.121) given above is also due to Kotel’nikov. In the general case this question 
was later studied by the outstanding American engineer and mathematician Claude 
Shannon, whose work in 1948 provided the fundamentals the information theory. 

Remark 9 In reality the transmission and receiving time of a communication is also 
limited, so that instead of the entire series (18.121) we take one of its partial sums 
H-N- Special research has been devoted to estimating the errors that thereby arise. 

Remark 10 If we assume that the amount of information transmitted over the com- 
munication channel is proportional to the amount of reference values, then accord- 
ingly to formula (18.121) the communication channel capacity is proportional to its 
bandwidth frequency. 


18.3.5 Problems and Exercises 

1. a) Write out the proof of relations ( 1 8.93)— (1 8.96) in detail. 

b) Regarding the Fourier transform as a mapping f t-> f, show that it has the 
following frequently used properties: 
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(the change of scale rule); 


1 V<o 
fiat) h* -/( - 
a \a 


fit - to) H* / (co)e~ i(0, ° 


(time shift of the input signal - the Fourier pre-image - or the translation theorem) 


[f (t + to) ± fit -to)] 


/(ft>) 2 cos co to, 
fia>)2sin(jL)to\ 


f it)e ±iaot ^ fico± wo) 


(frequency shift of the Fourier transform); 


fit) cos co 0 t l-> -[/(cu - too) + /(« + «o)], 

fit) sincuof h* ^ [/ (ft) - ft) 0 ) - /(ft) + ft)o)] 

(amplitude modulation of a harmonic signal); 

fit) sin 2 i-* ^-[2/ («) - /(ft) - coo) - /(ft) + ft) 0 )]. 

c) Find the Fourier transforms (or, as we say, the Fourier images) of the follow- 
ing functions: 


n A (0 = 


M for | f | < A , 
0 for 1/ > A 


(the rectangular pulse)'. 


n A f) coscoot 


(a harmonic signal modulated by a rectangular pulse); 

77/i (r + 2 A) + I7 A (t — 2A) 
(two rectangular pulses of the same polarity); 

77/1 it — A) — 77/i it + A) 
(two rectangular pulses of opposite polarity); 


A a (7) — 


^( 1 _W) for |f| < A, 
0 for 1/ > A 
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(a triangular pulse); 

cos at 2 and sin at 2 ( a >0); 

| t\~i and |fr5e“ fl|?l (o>0). 


d) Find the Fourier pre-images of the following functions: 

cd A sin 2 co A 2 

sine , 2 i , 2 sine , 

7 r coA it 

where sine ^ := is the sample function (cardinal sine). 

e) Using the preceding results, find the values of the following integrals, which 
we have already encountered: 


f°° sin.x 

f°° sin 2 x 

2 

f°° 

/ dx, 

/ ? dx\ 

/ cos x dx , 

/ sin x dx 

J — oo X 

J-o o ^ 

J — oo 

J — oo 


f) Verify that the Fourier integral of a function fit) can be written in any of the 
following forms: 


/ OO j poo p 

f(cD)e im dcD=— / d co f (x)e 

-oo ^ J — oo J — 


-im(x-t) ^ = 


;/>/ 


— OO 
OO 


f(x) cos 2 cd(x — t) dx. 


2. Let f — f (x , y) be a solution of the two-dimensional Laplace equation j-j + 

$2 f 

= 0 in the half-plane y > 0 satisfying the conditions f(x, 0) = g(x) and 
fix, y) -* 0 as y — > +oo for every x e M. 

a) Verify that the Fourier transform f(f y) of / on the variable x has the form 
g(?)e-^. 

b) Find the Fourier pre-image of the function e~- y ^ on the variable f 

c) Now obtain the representation of the function / as a Poisson integral 


/(* 




y 


-oo (x-H) +y 


rg($) d?. 


which we have met in Example 5 of Sect. 17.4. 

3. We recall that the nth moment of the function / : R — »■ C is the quantity M n (/) = 
f_ x" f (x) dr. In particular, if / is the density of a probability distribution, that is, 
f(x) > 0 and J™ fix) dx = 1, then .xo = M\(f) is the mathematical expectation 
of a random variable x with the distribution / and the variance cj 2 := (x — 
xq ) 2 fix) dx of this random variable can be represented as a 2 = Mi(f) — M 2 (f). 
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Consider the Fourier transform 

/ oo 

/(x)e-^dx 

-OO 


of the function /. By expanding e l ^ x in a series, show that 

a) f(M) = EZo M) "„T" (/) g n ^ for example, feS. 

b) M n (f) = (i) n fW(0),n = 0,1,... v 

c) Now let / be real-valued, and let /(£) = where A(§) is the ab- 

solute value of /(£) and <p(%) is its argument; then A(§) = A(— §) and <p(—$) = 
—<p(i;). To normalize the problem, assume that f(x)dx = 1. Verify that in that 
case 


m) = i+i<p'm + 


A "( 0 ) - (^( 0)) 2 
2 


e+o{e) 




and 


x 0 :=M l (f) = -cp'( 0), and a 2 = M 2 (f) — M 2 (f) = — A"(0). 

4. a) Verify that the function e _fl bl ( a > Q), like all its derivatives, which are de- 
fined for x 7^ 0, decreases at infinity faster than any negative power of \x and yet 
this function does not belong to the class S. 

b) Verify that the Fourier transform of this function is infinitely differentiable on 
R, but does not belong to S (and all because e -0 ^ is not differentiable at x — 0). 


5. a) Show that the functions of class S are dense in the space 72.2 (R”-C) of 
functions f : W C whose squares are absolutely integrable, endowed with 
the inner product (f,g) = / R „ ( / • g)(.r)dx and the norm it generates ||/|| = 
(/ R « \f\ 2 (x)dx ) l/2 and the metric d(f, g)=\\f - g\\. 

b) Now let us regard S' as a metric space (S, d) with this metric d (convergence 
in the mean-square sense on R"). Let L2(®"> C) or, more briefly, L2, denote the 
completion of the metric space (S, d) (see Sect. 9.5). Each element / e Li is de- 
termined by a sequence \(pk ) of functions (pk e S that is a Cauchy sequence in the 
sense of the metric d. 

Show that in that case the sequence {dp} of Fourier images of the functions cpp is 
also a Cauchy sequence in S and hence defines a certain element / e L 2 , which it 
is natural to call the Fourier transform of f e Li. 

c) Show that a vector-space structure and an inner product can be introduced in 
a natural way on L 2 , and in these structures the Fourier transform 7.2— >- L 2 turns out 
to be a linear isometry of L 2 onto itself. 

d) Using the example of the function f{x) = . 1 one can see that if / e 

V 1+* 2 

72-2 (R, C) we do not necessarily have / e 72 (R, C). Nevertheless, if / e 722(R, C), 
then, since / is locally integrable, one can consider the function 


/a(S) = 
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Verify that /a € C(M, C) and e 7£ 2 (R, C). 

e) Prove that /a converges in L 2 to some element f e L 2 and ||/a|| —*■ ||/|| = 
|| /|| as A —*■ +00 (this is Plancherel’s theorem 26 ). 

6. The uncertainty principle. Let q>(x) and be functions of class S (or 

elements of the space L 2 of Problem 5), with / = {p and f^°\<p\ 2 (x)dx = 

/ oo o 'y o 

-00 IV'1 (p)dp = L In this case the functions \(p\ and |i/r| can be regarded as 
probability densities for random variables x and p respectively. 

a) Show that by a shift in the argument of cp (a special choice of the point 
from which the argument is measured) one can obtain a new function q> such that 
M\(\tp\) = x\(p\ 2 (x) dx = 0 without changing the value of ||^||, and then, with- 

out changing the relation M\(\(p\) = 0 one can, by a similar shift in the argument of 
\jr arrange that Mi(|i/r|) = p\\js\ 2 (p)dp — 0. 

b) For real values of the parameter a consider the quantity 



ax 


(p(x) + tp'ix)^ dx > 0 


and, using Parseval’s equality and the formula <p'{p) = ip<p(p), show that 
orA/ZI^I) — a + A/’di/'l) > 0. (For the definitions of M\ and M 2 see Problem 3.) 

c) Obtain from this the relation 


M 2 (|^|)-M 2 (|/|)> 1/4. 


This relation shows that the more “concentrated” the function (p itself is, the 
more “smeared” its Fourier transform, and vice versa (see Examples 1 and 7 and 
Problem 7b). 

In quantum mechanics this relation, called the uncertainty principle, assumes 
a specific physical meaning. For example, it is impossible to measure precisely 
both the coordinate of a quantum particle and its momentum. This fundamental 
fact (called Heisenberg's 27 uncertainty principle), is mathematically the same as 
the relation between M 2 (|<p|) and M 2 ( \x/f\) found above. 

The next three problems give an elementary picture of the Fourier transform of 
generalized functions. 

7. a) Using Example 1, find the spectrum of the signal expressed by the functions 

f J- for |r| < a, 

zi a ( f )= 2a- 11 - 

I 0 for |f| > a. 


b) Examine the variation of the function A a (t) and its spectrum as a — ► +0 
and tell what, in your opinion, should be regarded as the spectrum of a unit pulse, 
expressed by the (5-function. 


26 M. Plancherel (1885-1967) - Swiss mathematician. 

27 W. Heisenberg (1901-1976) - German physicist, one of the founders of quantum mechanics. 
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c) Using Example 2, now find the signal cp(t) emerging from an ideal low- 
frequency filter (with upper frequency limit a) in response to a unit pulse S(t). 

d) Using the result just obtained, now explain the physical meaning of the terms 
in the Kotel’nikov series (18.121) and propose a theoretical scheme for transmitting 
a band-limited signal /(f). based on Kotel’nikov’s formula (18.121). 

8. The space ofL. Schwartz. Verify that 

a) If <p e S and P is a polynomial, then (P ■ <p) e S. 

b) If tp e .S', then D a (p e S and D^( PD a tp) e S , where a and ft are nonnegative 
multi-indices and P is a polynomial. 

c) We introduce the following notion of convergence in S. A sequence {(pk ) of 
functions e S converges to zero if for all nonnegative multi-indices a and ft the 
sequence of functions {x@ D a tp(x)} converges uniformly to zero on R". The relation 
tpk —»■ tp € S will mean that (<p — tpft) — »■ 0 in S. 

The vector space S of rapidly decreasing functions with this convergence is 
called the Schwartz space. 

Show that if tpk —*■ tp in S , then tpp -» ftp in S as k — > oo. Thus the Fourier trans- 
form is a continuous linear operator on the Schwartz space. 

9. The space S' of tempered distributions. The continuous linear functionals defined 
on the space S of rapidly decreasing functions are called tempered distributions. The 
vector space of such functionals (the conjugate of S ) is denoted S'. The value of the 
functional F e S' on a function tp e S will be denoted T(<p). 

a) Let P : M" -» C be a polynomial in n variables and / : M" -> C a locally 
integrable function admitting the estimate |/(jc)| < \P(x)\ at infinity (that is, it may 
increase as x — > oo, but only moderately: not faster than power growth). Show that 
/ can then be regarded as a (regular) element of S' if we set 



b) Multiplication of a tempered distribution F e S' by an ordinary function / : 
R" —*■ C is defined, as always, by the relation ( fF)(tp ) := F(fcp). Verify that for 
tempered distributions multiplication is well defined, not only by functions / e S, 
but also by polynomials P : M" — > C. 

c) Differentiation of tempered distributions T e S' is defined in the traditional 

way: (£>“ F){<p) := F (D a <p) . 

Show that this is correctly defined, that is, if F e S’, then D a F e S' for every 
nonnegative integer multi-index a = (<y \ . . ... a„ ). 

d) If / and tp are sufficiently regular functions (for example, functions in .S'), 
then, as relation (18.113) shows, the following equality holds: 
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This equality (Parseval’s equality) is made the basis of the definition of the 
Fourier transform F of a tempered distribution F e S'. By definition we set 
F(<P) ■= F($). 

Due to the invariance of S under the Fourier transform, this definition is correct 
for every element F e S' . 

Show that it is not correct for generalized functions in 2?'(K") mapping the space 
2?(M") of smooth functions of compact support. This fact explains the role of the 
Schwartz space S in the theory of the Fourier transform and its application to gen- 
eralized functions. 

e) In Problem 7 we acquired a preliminary idea of the Fourier transform of the 
5-function. The Fourier transform of the 5-function could have been sought directly 
from the definition of the Fourier transform of a regular function. In that case we 
would have found that 


5(f) = 


1 


(2 n) n ! 2 


f 

JW 1 


S(x)e 


djc = 


1 


( 2 nr )"/2 ' 


Now show that when we seek the Fourier transform of the tempered distribution 
8 e S'(R n ) correctly, that is, starting from the equality 8(cp) = 8(cp ), the result (still 
the same) is that 8(<p ) = <p(0) = 2 • (One can renormalize the Fourier transform 

so that this constant equals 1; see Problem 10.) 

f) Convergence in S', as always in generalized functions, is understood in 
the following sense: ( F n — »■ F) in S' as n — > 00 := (V<p e S ( F„((p ) — >■ F(cp) as 
n — > 00 )). 

Verify the Fourier inversion formula (the Fourier integral formula) for the <5- 
function: 


8{x) = lim - — ., , 

A-S-+00 ( 27 T )"/ 2 J-a 


/ A /» A 
... / , 
-A J-A 


<5(f)e ,( *’ ?) df. 


g) Let 8 (x — ao), as usual, denote the shift of the 8 -function to the point .ro, that 
is, 8(x — xq)(<p) = (p(x 0 ). Verify that the series 


OO 

8 (x — n ) 

n= — 00 


(= lim y^5(x — n)\ 

V 1 


converges in S'(K"). (Here 8 e 5'(M'') and n e Z.) 

h) Using the possibility of differentiating a convergent series of generalized 
functions termwise and taking account of the equality from Problem 13f of 
Sect. 18.2, show that if F — Y^=-oc 5( x ~ H ), then 

OO 

F = V2jz E 8(x — 2nn). 

n=— 00 


i) Using the relation F((p) = F(q>), obtain the Poisson summation formula from 

the preceding result. 
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j) Prove the following relation (the 6 -formula) 


OO I — oo 



e 


,2 


(t >0), 


which plays an important role in the theory of elliptic functions and the theory of 
heat conduction. 


10 . If the Fourier transform Fif] of a function / : M — »■ C is defined by the formu- 
las 



many of the formulas relating to the Fourier transform become particularly simple 
and elegant. 



a) Verify that f(u) = -±=f (£). 



b) Show that — f(—t), that is, 



This is the most natural form of the expansion of / in harmonics of different 
frequencies v, and f(y) in this expansion is the frequency spectrum of f . 

c) Verify that <5=1 and 1 = <5. 

d) Verify that the Poisson summation formula (18.1 16) now assumes the partic- 
ularly elegant form 


OO 


OO 


^2 <P(")= ^2 VW- 


n = — oo 


n =— oo 


Chapter 19 

Asymptotic Expansions 


The majority of phenomena that we have to deal with can be characterized mathe- 
matically by a certain set of numerical parameters having rather complicated inter- 
relations. However the description of a phenomenon as a rule becomes significantly 
simpler if it is known that some of these parameters or some combination of them 
is very large or, contrariwise, very small. 

Example 1 In describing relative motions occurring with speeds i> that are much 
smaller than the speed of light (|u| <$C c) we may use, instead of the Lorentz trans- 
formations (Example 3 of Sect. 1.3) 



x 


the Galilean transformation 


x' = x — vt, t' = t, 


since v/c & 0. 


Example 2 The period 



of oscillations of a pendulum is connected with the maximal angle of deviation cpo 
from its equilibrium position via the parameter k 2 = sin 2 ^ (see Sect. 6.4). If the 
oscillations are small, that is, <po ~ 0, we obtain the simple formula 



for the period of such oscillations. 


© Springer- Verlag Berlin Heidelberg 2016 

V.A. Zorich, Mathematical Analysis II, Universitext, 

DOI 1 0. 1 007/978-3-662-48993-2_ 1 1 


587 


588 


19 Asymptotic Expansions 


Example 3 Suppose a restoring force acting on a particle m is returning it to its equi- 
librium position and that the force is proportional to the displacement (a spring with 
spring constant k, for example). Suppose also that the resisting force of the medium 
is proportional to the square of the velocity (with coefficient of proportionality a). 
The equation of motion in that case has the following form (see Sect. 5.6): 

m'x + ax 2 + kx = 0. 

If the medium “rarefies”, then a —> 0 and one may assume that the motion is 
approximated by the motion described by the equation 

mx + kx — 0 

(harmonic oscillations with frequency J~^), and if the medium “condenses”, then 

a — »■ oo, and, dividing by a, we find in the limit the equation x 2 — 0, that is, x{t) = 
const. 


Example 4 If tc(x ) is the number of primes not larger than x e R. then, as is known 
(see Sect. 3.2), for large x the quantity n(x) can be found with small relative error 
by the formula 


n(x) & 


x 

In x 


Example 5 It would be difficult to find more trivial, yet nevertheless important re- 
lations than 


sinx^x or ln(l+x): 


in which the relative error becomes smaller as x approaches 0 (see Sect. 5.3). These 
relations can be made more precise if desired, namely 


1 


1 


sinx : 


3! 1 


■ x X 

2 


ln(l + x) : 

by adjoining one or more of the following terms obtained from the Taylor series. 


Thus the problem is to find a clear, convenient, and essentially correct description 
of a phenomenon being studied using the specifics of the situation that arises when 
some parameter (or combination of parameters) that characterizes the phenomenon 
is small (tends to zero) or, contrariwise, large (tends to infinity). 

Hence, we are once again essentially discussing the theory of limits. 

Problems of this type are called asymptotic problems. They arise, as one can see, 
in practically all areas of mathematics and natural science. 

The solution of an asymptotic problem usually consists of the following stages: 
passing to the limit and finding the (main term of the) asymptotics, that is, a conve- 
nient simplified description of the phenomenon; estimating the error that arises in 
using the asymptotic formula so found, and determining its range of applicability; 
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then sharpening the main term of the asymptotics, analogous to the process of ad- 
joining the next term in Taylor’s formula (but far from being equally algorithmic in 
every case). 

The methods of solving asymptotic problems (called asymptotic methods) are 
usually closely connected with the specifics of a problem. Among the few rather 
general and at the same time elementary asymptotic methods one finds Taylor’s 
formula, one of the most important relations in differential calculus. 

The present chapter should give the reader a beginning picture of the elementary 
asymptotic methods of analysis. 

In the first section we shall introduce the general concepts and definitions relating 
to elementary asymptotic methods; in the second we shall use them in discussing 
Laplace’s method of constructing the asymptotic expansion of Laplace transforms. 
This method, which was discovered by Laplace in his research on the limit theo- 
rems of probability theory, is an important component of the saddle-point method 
later developed by Riemann, usually discussed in a course of complex analysis. 
Further information on various asymptotic methods of analysis can be found in the 
specialized books cited in the bibliography. These books also contain an extensive 
bibliography on this circle of questions. 


19.1 Asymptotic Formulas and Asymptotic Series 

19.1.1 Basic Definitions 

a. Asymptotic Estimates and Asymptotic Equalities 

For the sake of completeness we begin with some recollections and clarifications. 

Definition 1 Let / : X — >• Y and g : X — > Y be real- or complex- or in general 
vector- valued functions defined on a set X and let B be a base in X. Then the 
relations 


/ = 0(g) or f(x) = 0(g(x)) xeX 

f = 0(g) or f(x) = 0(g(x)) over the base B 

f — o(g) or f{x) = o(g(x)) over the base B 

mean by definition that in the equality \ f{x)\ = a(x)\g(x)\, the real-valued func- 
tion a(x) is respectively bounded on X, ultimately bounded over the base B. and 
infinitesimal over the base B. 

These relations are usually called asymptotic estimates (of /). 

The relation 


/~g or f(x)~g(x) over the base B, 
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which by definition means that /(*) = g(x) + o(g(x)) over the base B, is usu- 
ally called asymptotic equivalence or asymptotic equality 1 of the functions over the 
base B. 

Asymptotic estimates and asymptotic equalities unite in the term asymptotic for- 
mulas. 

Wherever it is not important to indicate the argument of a function the abbrevi- 
ated notations / = o(g), f — 0(g), or / ~ g are used, and we shall make system- 
atic use of this abbreviation. 

If / = 0(g) and simultaneously g — 0(f), we write / x g and say that / and 
g are quantities of the same order over the given base. 

In what we are going to be doing below, T = (C or T = R, X C C or I C i; B 
as a rule is one of the bases X9r^0orX9r->oo. Using this notation one can 
write in particular that 


cos* = 0(1), rel, 
cosz ^ 0(1), z e C, 
lne 1 = 1 + z + o(z) as z -> 0, zeC, 

(1 + *)" = 1 + ax + o(x) as x — »■ 0, teR, 

X ( X \ 

jt(x) = b o( ) as x -» +oo, x 6 R. 

In* V In* / 


Remark 1 In regard to asymptotic equalities it is useful to note that they are only 
limiting relations whose use is permitted for computational purposes, but only after 
some additional work is done to find an estimate of the remainder. We have already 
mentioned this when discussing Taylor’s formula. In addition, one must keep in 
mind that asymptotic equivalence in general makes it possible to compute with small 
relative error, but not small absolute error. Thus, for example, as * — »■ +oo, the 
difference n(x) — does not tend to zero, since tt(x) jumps by 1 at each prime 
integer value of*. At the same time, the relative error in replacing jt(x) by tends 
to zero: 

o(rM 

— > 0 as*— > +oo. 


This circumstance, as we shall see below, leads to asymptotic series that have 
computational importance when one considers the relative error but not the abso- 
lute error; for that reason these series are often divergent, in contrast to classical 
series, for which the absolute value of the difference between the function being 
approximated and the /7th partial sum of the series tends to zero as n — > +oo. 

Let us consider some examples of ways of obtaining asymptotic formulas. 


1 It is also useful to keep in mind the symbol ~ often used to denote asymptotic equivalence. 
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Example 6 The labor involved in computing the values of n! or inn! increase as 
n € N increases. We shall use the fact that n is large, however, and obtain under that 
assumption a convenient asymptotic formula for computing In n ! approximately. 

It follows from the obvious relations 


rn n r*k 11 “ /» /c— |— l r 

/ lnxdx= > / lnxdx< > \nk < > / lnxdx = / 


n n 


r>k+ 1 rn+\ 

Inx dx = / lnxdx 


that 


But 


!> 


J Inxdx + j 


n+1 

0 < I n n ! — / Inx dx < / lnxdxT / lnx dx < ln2(« + 1). 


Inx dx = n(lnn — 1) + 1 = n Inn — (n — 1), 
and therefore as n -> oo 


Inn! = J lnx dx + 0(ln2(n + 1)) = 

= n Inn — (n — 1) + O(lnn) = n Inn + 0{ri). 


Since 0(n ) = o(n Inn) when n — > +oo, the relative error of the formula Inn! ~ 
n Inn tends to zero as n -> Too. 


Example 7 We shall show that as x -> Too the function 

Mx)= f ^-d r (net) 

J 1 * 

is asymptotically equivalent to the function g n (x) — x~ n s x . Since g„(x) — > +oo as 
x — >■ Too, applying L’Hopital’s rule we find 


fn(x) 

lim = lim 


fnW 


= lim 


x— >+oo g n (x) X^+OO g' n (x) X— >00 X n e X 


= 1. 


Example 8 Let us find the asymptotic behavior of the function 


f x e' 

fix) = / -dr 
J l 1 

more precisely. It differs from the exponential integral 

f x s' 

Ei(x) — — dr 

J —OO t 


only by a constant term. 
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This last integral, as shown in Example 7, is <9(.r _( " +l ) e x ) as x —> +oo. Includ- 
ing in the term 0(x~^ n+ ^e x ) the constant — e Ylk=i ~ !)• obtained when t = 1 is 
substituted, we find that 


" (fc- l)! 

/«=<= E l at l + ° 


k=\ 


r n + 1 


as x — >■ +oo. 


The error 0(-§+j-) in the approximate equality 


fix) 


, A (*-D! , 

yrk e 


t=l 


is asymptotically infinitesimal compared with each term of the sum, including the 
last. As the same time, as x —*■ +oo each successive term of the sum is infinitesimal 
compared with its predecessor; therefore it is natural to write the continually sharper 
sequence of such formulas as a series generated by / : 


fix) - e A J2 


ik-l)\ 


k= 1 


We note that this series obviously diverges for every value of x e M, so that we 
cannot write 


fix) - e A J2 


ik-l)'. 


k=l 


Thus we are dealing here with a new and clearly useful asymptotic interpre- 
tation of a series connected, in contrast with the classical case, with the relative 
rather than the absolute error of approximation of the function. The partial sums of 
such a series, in contrast to the classical case, are used not so much to approximate 
the values of the function at specific points as to describe their collective behav- 
ior under the limiting passage in question (which in the present example occurs as 
X — »■ +oo). 
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b. Asymptotic Sequences and Asymptotic Series 

Definition 2 A sequence of asymptotic formulas 

f(x) = fo(x) + o{f 0 (x)), 
fix) = \// 0 ix) + fiix) + iCO), 

fix) = iroix) + flix)-\ h ifnix) + o(^nix)), 


that are valid over a base B in the set X where the functions are defined, is written 
as the relation 


fix) ~ ijr 0 {x) + fiix) H h if „(x) H 

or, more briefly, as fix) — Y^tLo ’'kkix )■ It is called an asymptotic expansion of fin 
the given base B. 

It is clear from this definition that in asymptotic expansions we always have 
oipknixf) — ifn+tix) + o(i/r ;i+ i(x)) over the base B, 
and hence for any n — 0 , 1 , 2 .... we have 

tfn- 1 _ i (.T ) = o(ff n ix)) over the base B, 

that is, each successive term of the expansion contributes its correction, which is 
asymptotically more precise in comparison with its predecessor. 

Asymptotic expansions usually arise in the form of a linear combination 

co<Poix) + c\ipiix) H h c,j(p n ix) H 

of functions of some sequence {q> n ix)} that is convenient for the specific problem. 

Deflnition3 Let A be a set with a base B defined on it. The sequence { <p„ (x ) } 
of functions defined on X is called an asymptotic sequence over the base B if 
tp n+ \ix) — oi(p n ix)) over the base B (for any two adjacent terms q> n and tp n +\ of 
the sequence) and if none of the functions tp n e {(p n ix)} is identically zero on any 
element of B. 

Remark 2 The condition that itp n \B)ix) ^ 0 on the elements B of the base B is 
natural, since otherwise all the functions cp n +\, (p n +2, ■ ■ ■ would also be zero on B 
and the system {<£>„} would be trivial in respect to its asymptotics. 


Example 9 The following sequences are obviously asymptotic: 
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a) 1, x, x 2 , . . . , x n , . . . as x — »■ 0; 

b) 4r, ... asx oo; 

c) X Pl , X P2 , . . . , X Pn , . . . 

in the base x — > 0 if pi < p 2 < ■ ■ ■ < p n < ■ ■ • , 
in the base x — »• oo if p\ > pj > ■ • ■ > p n > ■ • • ; 

d) the sequence {g(x)(p n (x)} obtained from an asymptotic sequence through 
multiplication of all its terms by the same function. 

Definition 4 If {</>„} is an asymptotic sequence over the base B. then an asymptotic 
expansion of the form 

fix) ^ C 0 (poix) + C\(p\{x) H b C n <Pn(x ) H 

is called an asymptotic expansion or asymptotic series of the function f with respect 
to the asymptotic sequence {ip n ) over the base B. 

Remark 3 The concept of an asymptotic series (in the context of power series) was 
stated by Poincare (1886), who made vigorous use of asymptotic expansions in his 
work on celestial mechanics. But asymptotic series themselves, like some of the 
methods of obtaining them, had been encountered earlier. In regard to the possible 
generalization of the concept of an asymptotic expansion in the sense of Poincare 
(which we have discussed in Definitions 2-4) see Problem 5 at the end of this sec- 
tion. 


19.1.2 General Facts About Asymptotic Series 

a. Uniqueness of an Asymptotic Expansion 

When we speak of the asymptotic behavior of a function over a base B, we are 
interested only in the nature of the limiting behavior of the function, so that if two 
generally different functions / and g are equal on some element of the base B. they 
have the same asymptotic behavior over B and should be considered equal in the 
asymptotic sense. 

Moreover, if we fix in advance some asymptotic sequence {q > n } in terms of which 
it is desirable to carry out an asymptotic expansion, we must reckon with the lim- 
ited possibilities of any such system of functions {tp n }- To be specific, there will be 
functions that are infinitesimal with respect to every term tp n of the given asymptotic 
system. 


Example 10 Let tp n (x) = A ,n = 0, 1 .... ; then e x = o(tp n (x)) as x — »■ +oo. 


For that reason it is natural to adopt the following definitions. 
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Definitions If {tp n {x ) } is an asymptotic sequence over the base B, a function / 
such that f{x ) = o((p„ (.v)) over B for each n — 0, 1 .... is called an asymptotic zero 
with respect to {tp n (x)}. 

Definition 6 Functions / and g are asymptotically equal over the base B with re- 
spect to a sequence of functions {q> n } that is asymptotic over B if the difference 
/ — g is an asymptotic zero with respect to {tp n }. 

Proposition 1 (Uniqueness of an asymptotic expansion) Let {q> n } be an asymptotic 
sequence of functions over a base B. 

a) If a function f admits an asymptotic expansion with respect to the sequence 
{tp n } over B, then that expansion is unique. 

b) If the functions f and g admit an asymptotic expansion in the system {tp n }, then 
these expansions are the same if and only if the functions f and g are asymptotically 
equal over B with respect to {tp n } . 

Proof a) Suppose the function tp is not identically zero on any element of B. 

We shall show that if f(x) = o(tp(x)) over B, and at the same time fix) — 
ctpix) + o(tp{x)) over B, then c = 0. 

Indeed, |/(jc)| > \ctp{x)\ — \o(tp(x))\ = |c||<p(.r)| — oi\tp{x)\) over B, and so if 
|c| >0, there exists B\ e B at each point of which |/(jc)| > ^ \tpix)\. But if fix) = 
o(q>(x)) over B, then there exists Bi e B at each point of which \f(x)\ < ^-\<p(x)\. 
Hence at each point x e B[ fl ZL we would have to have ^|^(x)| < y-|<p(x)| or, 
assuming |c| ^ 0, 3 | < 2\tpix)\. But this is impossible if tp(x) ^ 0 at even one 
point of Bi n B 2 . 

Now let us consider the asymptotic expansion of a function / with respect to the 
sequence {q>„}. 

Let f(x) — CQtpoix) + o(tpo(x)) and f(x) — CQ<p(x) + o(<po(x )) over B. Subtract- 
ing the second equality from the first, we find that 0 = (co — col^oU) + o(<po(x)) 
over B. But 0 = o(<po(x)) over B and so, by what has been proved, co — co = 0. 

If we have proved that <?o = ?o, • . . , c„_i = c„_ i for two expansions of the func- 
tion / in the system {<£>„}, then by the equalities 

fix) = Cotpoix) H h C n -\(p n -\{x) + C n (p n {x) + o((p n (x)), 

fix ) = cotpoix) H 1- C n -\tp n -lix) + C„<Pnix :) + o((p n (x)) 

we find in the same way that c„ = c„ . 

By induction we now conclude that a) is true. 

b) If f{x) — cotpoix) H h c n (p n ix) + o((p„(x)) and g(x) — cotpoix) H h 

c n tPnix) + o(<p n ix)) over B, then f{x) — g(x) = o{tp n {x)) over B for each n = 
0, 1 , . . . , and hence the functions / and g are asymptotically equal with respect to 
the sequence {tp n ix)}. 

The converse follows from a), since an asymptotic zero, which we take to be the 
difference f — g, can have only the zero asymptotic expansion. □ 
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Remark 4 We have discussed the question of uniqueness of an asymptotic expan- 
sion. We emphasize, however, that an asymptotic expansion of a function with re- 
spect to a preassigned asymptotic sequence is by no means always possible. Two 
functions / and g in general need not always be connected by one of the asymp- 
totic relations f = 0(g), f = o(g) or / ~ g over a base B. 

The very general asymptotic Taylor formula, for example, exhibits a specific 
class of functions (having derivatives of order up to n at x — 0), each of which 
admits the asymptotic representation 

fix) = /( 0) + i/'( [0)x + • • • + -f {n \0)x n + o(x n ) 

1! n\ 

as x -> 0. But even the function v l/2 cannot be expanded asymptotically in the sys- 
tem l,x,x 2 , Thus one must not identify an asymptotic sequence and an asymp- 

totic expansion with any canonical base and the expansion of any asymptotic in it. 
There are many more possible types of asymptotic behavior than can be described 
by any fixed asymptotic sequence, so that the description of the asymptotic behav- 
ior of a function is not so much an expansion in terms of a preassigned asymptotic 
system as it is the search for such a system. One cannot, for example, when com- 
puting the indefinite integral of an elementary function, require in advance that the 
result be a composition of certain elementary functions, because it may not be an 
elementary function at all. The search for asymptotic formulas, like the computation 
of indefinite integrals, is of interest only to the extent that the result is simpler and 
more accessible to investigation than the original expression. 


b. Admissible Operations with Asymptotic Formulas 

The elementary arithmetic properties of the symbols o and O (such properties as 
o(g) + o(g) — o(g), o(g) + 0(g) — 0(g) + 0(g) = 0(g), and the like) have been 
studied along with the theory of limits (Proposition 4 of Sect. 3.2). The following 
obvious proposition follows from these properties and the definition of an asymp- 
totic expansion. 

Proposition 2 (Linearity of asymptotic expansions) If the functions f and g admit 
asymptotic expansions f ~ YfiftLoQnVn and g ~ YlnLo bn<Pn with respect to the 
asymptotic sequence {ip n } over the base B, then a linear combination of them af + 
fig admits such an expansion, and (af + fig) — Y^nLoi aa n + fibniVn- 

Further properties of asymptotic expansions and asymptotic formulas in general 
will involve more and more specialized cases. 

Proposition 3 (Integration of asymptotic equalities) Let f be a continuous function 
on the interval I — [a, a>[ (or I = ]a>, a]). 
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a) If the function g{x) is continuous and nonnegative on I and the integral 
fa g(x) cbc diverges, then the relations 


fix)=0(g(x)), f(x) = o(g(x)), f(x)~g(x) asl3x- 

imply respectively that 

F{x) = 0(G{xf), F(x) = o(G(x)), and F(x) ~ G(x), 


co 


where 



fit) At and G(x) = f git) At. 


b) If the functions (p„ix), n = 0, 1, , which are continuous and positive on 
I = [a, co[form an asymptotic sequence as I 3 x — ► co and the integrals <J> n ix) — 
f (p n it)dt converge for x e I then the functions <P n (x), n = 0, 1, .. . also form an 
asymptotic sequence over the base I 3 x — > co. 

c) If the integral Fix) = ff fix) Ax converges and f has the asymptotic expan- 
sion fix) — En=oC n (p n (x) as I 3 x — > co with respect to the asymptotic sequence 
[<p n (x)} ofb), then Fix ) has the asymptotic expansion Fix) — 'ff ( i^=o Cn< ^ > "i x )- 


Proof a) If fix) — Oigix)) as I 3 x -» co, there exists xo € I and a constant M 
such that |/(x)| < Mgix) for x e [xo, co[. It follows that for x e [xq, a>[, we have 
I fa fit) dr I < I fa 0 fit) At\ + Mf x x o git) At = Oiff git) dr). 

To prove the other two relations one can use L’HopitaTs rule (as in Example 7), 
taking account of the relation G(x) = f git) dr — »■ oo as I 3 x -3- co. As a result, 
we find 


lim 


Fix) 


I^x — yeti G(x) 


lim 


/ — > 0 ) 


F'jx) 

G'ix) 


lim 


13 X—>CO 


fix) 
gix) ' 


b) Since <P n (x) -> 0 as I 3 x -3- co in = 0, 1 , . . .) applying L’Hopital’s rule again, 
we find that 


lim 


d> n +\ (X) 


= lim 

<£„(x) <£' (x) 

c) The function r„(x) in the relation 


K+iM 


lim 

I3X^-C0 


<Pn+\jx) 

<Pnix) 


= 0 . 


fix) = CQtpoix) + Ci^i(x) H b c n cp n ix) + r n (x), 

being the difference of continuous functions on I, is itself continuous on I , and we 
obviously have R„(x) = ff r n it) At -r 0 as / 3 x oi. But r n ix) — o(<p„(x)) as 
/3x->ffl and 0„ (x) — > 0 as / 3 x — > a>. Therefore, again by L’HopitaTs rule, it 
follows that in the equality 

Fix) — ca&oix) + c\0\ix) H b c„<f„(x) + R n ix) 

the quantity R n ix) is o(<f„(x)) as / 3 x — > oo. □ 
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Remark 5 Differentiation of asymptotic equalities and asymptotic series is gener- 
ally not legitimate. 

Example 11 The function f (x) = eT x sin(e A ) is continuously differentiable on LRi 
and is an asymptotic zero with respect to the asymptotic sequence {A} as x — >• +oo. 
The derivatives of the functions up to a constant factor, again have the form 
Jf. However the function f'(x) — — e A sin(e A ) + cos(e A ) not only fails to be an 
asymptotic zero; it doesn’t even have an asymptotic expansion with respect to the 
sequence {4r} as x —*■ +oo. 


19.1.3 Asymptotic Power Series 

In conclusion, let us examine asymptotic power series in some detail, since they are 
encountered relatively often, although in a rather generalized form, as was the case 
in Example 8. 

We shall study expansions with respect to the sequence {x n \ n = 0, 1, . . .}, which 
is asymptotic as x —*■ 0 and with respect to {4r; n = 0, 1, . . .}, which is asymptotic as 
x — »■ oo. Since these are both the same object up to the change of variable x = - , we 
state the next proposition only for expansions with respect to the first sequence and 
then note the specifics of certain of the formulations given in the case of expansions 
with respect to the second sequence. 

Proposition 4 Let 0 be a limit point of E and let 

f{x) — ao + a\x + ajx 2 H , 

as E 3 x -3- 0. 

g(x) — bo + b\x + bix 2 H 


Then as E 3 x -3 0, 

a) ( oi f + fig) - E^=o(“«n + P b n)X n ; 

b) (/ • g)(x) ~ J2%Lo CnX "’ where c„ — a 0 b n + a\b n -\ H h a n b 0 , n — 0, 

1 ,...; 

c) if bo ^ 0, then (—)(x) — YlnLod n x n , where the coefficients d n can be found 
from the recurrence relations 

n 

a 0 = bodo, a\ — bodi + bido, a„ = b k d n -k, ...; 

k=o 

d) if E is a deleted neighborhood or one-sided neighborhood of 0 and f is con- 
tinuous on E, then 
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e) if in addition to the assumptions of d) we also have f e C fl> (E) and 

f'(x) ~ a' 0 + a[x H , 

then a' n — (n + l)a„+i, n — 0, 1, 

Proof a) This is a special case of Proposition 2. 

b) Using the properties of o ( ) (see Proposition 4 of Sect. 3.2), we find that 

(/•£)(*) = 

= fix) ■ g(x) = 

= (ao + a\x H b a n x n + o(x n ))(bo + b\x H b b„x " + o(x n )) = 

= (aobo) + (aobi + a\bo)x H b (aob„ + a\b n -\ H b a n bo)x n + o(x”) 

as E 3 x -> 0. 

c) If bo f- 0, then gix) f 0 for x close to zero, and therefore we can consider the 

ratio = h(x). Let us verify that if the coefficients do, ■ ■ ■ , d n in the representa- 
tion h(x) = do + d\x H b d n x n + r n (x) have been chosen in accordance with c), 

then r„( x) = o(x n ) as E 3 x — > 0. From the identity f(x) — g(x)h(x), we find that 

«0 + o\x + b a„x" + o(a'") = 

= (bo + b\x H b b n x n + o('V")) (do + d\x^ b d n x n + r n (x)) = 

= (bodo) + (bodi + b\do)x + ■ ■ ■ + ( bod n + b\d n —\ + ■ • ■ + b n do)x n + 

+ b Q r n (x) + o[r n (x )) + o(x n ), 

from which it follows that o(x n ) = bor n (jr) + o(r„ (x)) + o(x”), or r n (x) = o(x") as 
E 3 x —>■ 0, since bo ^ 0. 

d) This follows from part c) of Proposition 3 if we set a> = 0 there and recall that 
- fx f(t) dt = fo fit) dt. 

e) Since the function fix) is continuous on ]0,x] (or (x, ()[) and bounded (it 
tends to a' Q as x -3- 0), the integral f 0 f'(t)dt exists. Obviously f(x) = ao + 
fo fit) dE since fix) -» ao as x 0. Substituting the asymptotic expansion of 
fix) into this equality and using what was proved in d), we find that 

! 

, a \ 9 a r,-\ „ 

f (^) — ao + QqX + —x + • • • H x + • • • . 

2 n 

It now follows from the uniqueness of asymptotic expansions (Proposition 1 ) that 
a! n = in + 1 )a n , n = 0,1, □ 

Corollary 1 If JJ is a neighborhood ior one-sided neighborhood) of infinity in R 
and the function f is continuous in U and has the asymptotic expansion 

a i 02 a, i 

fix) — ao + — -b — t -b * * * ~b b • • • as U 3 x — oo, 

X X ~ x n 
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then the integral 

E M = (/ (t) — a o ~ ~) df 

over an internal contained in U converges and has the following asymptotic expan - 
sion'. 

02 Q3 a n 

J-(x) — — H » H 1 1 asU 3 x o o. 

^ 2x z 

Proof The convergence of the integral is obvious, since 


a i #2 

fit) — no — ~ as {/ 9 f — >■ oo. 

It remains only to integrate the asymptotic expansion 
a\ ci2 a n 

f it) — ao r ~ H !- — ■■■ ast/9f-»-oo, 


citing, for example, Proposition 3d). 


□ 


Corollary 2 If in addition to the hypotheses of Corollary 1 it is known that f e 
C (l) (U) and f admits the asymptotic expansion 

, , a\ a'-, a 1 

f ( .V ) 2^ On T — — y T * * * T — T • • ■ as U 3 x — > oo, 

x x~ n 

then this expansion can be obtained by formally differentiating the expansion of the 
function f and 

a' n = — (n — l)a„_i, n = 2,3,... and a' 0 =a' l =0. 

Proof Since f'{x) — Oq + y + 0(l/x 2 ) as U oo, we have 

f(x) = f(x o)+ f f'(t)dt — a' 0 x + a[ \nx + 0(1) 

Jxq 

as U 3 x — > oo; and since f(x) — ao + — + % H and the sequence x, ln.r, 1 , - , 

-^ 2 , ... is an asymptotic sequence as U 3 x -» oo, Proposition 1 enables us to con- 
clude that a' Q — a\ — 0. Now, integrating the expansion fix) — + -| + • • ■ , by 

Corollary 1 we obtain the expansion of fix), and by the uniqueness of the expan- 
sion we arrive at the relations a' n = —(;? — \)a„-\ for n = 2,3, □ 


19.1.4 Problems and Exercises 

1. a) Let/r(z)= Y^=o a nzT n for |z| > R, z € C. Show that then h(z) ~ Y^=o a nZ~ n 
as C 3 z -3- oo. 
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b) Assuming that the required solution v(x) of the equation y'(x) + y 2 (x) = 

sin \ has an asymptotic expansion y{x) — c n x ~" as x -> oo, find the first 

three terms of this expansion. 

c) Prove that if f(z ) = J2T=o a n^ n f° r kl < r, z e <C, and g(z) ~ b\z + b 2 Z 2 + 
■ ■ ■ as C 9 z — > 0, then the function / o g is defined in some deleted neighborhood 
of 0 e C and (/ o g)(z) ~ Co + c\z + C 2 Z 2 + • • • as C 9 z — > 0, where the coeffi- 
cients co, ci , . . . can be obtained by substituting the series in the series, just as for 
convergent power series. 

2. Show the following. 

a) If / is a continuous, positive, monotonic function for x > 0, then 

n pn 

\2f(k)= I /(x)dx + 0(f(n)) + 0(1) as n — > oo; 
k= o Jo ' 

b) Y!k=\ \ — ln« + c + o(l) as n ->• oo; 

c) Y^!k=\ k a (\nk)P ~ - — as n — > oo for a > — 1. 

3. Through integration by parts find the asymptotic expansions of the following 
functions as x —*■ +oo: 

a) r s (x) = / ( +(X3 t s ~ 1 e -r dt - the incomplete gamma function; 

b) erf(x) = -j= f* x e,~ ,2 dt - the probability error function (we recall that 
e~ x2 dx = *Jtt is the Euler-Poisson integral); 

c) Fix) — f+°° dr if a > 0. 

4. Using the result of the preceding problem, find the asymptotic expansions of the 
following functions as x — » +oo: 

a) Si(x) = Jq dt - the sine integral (we recall that J 0 °° dx = j is the 
Dirichlet integral). 

b) C(x) = /q cos jt 2 dt, S(x) = sin jt 2 dt - the Fresnel integrals (we recall 
that / 0 + °° cosx 2 dx = J 0 °° sinx 2 dx = \J\)- 

5. The following generalization of the concept of an expansion in an asymptotic 
sequence {<p ;! (x)} introduced by Poincare and studied above is due to Erdelyi. 2 

Let X be a set, B a base in X, { <y n (x ) } an asymptotic sequence of functions on X . 
If the functions fix), ifoix), VhC*), V^OO, - ■ ■ are suc h that the equality 

n 

fix) = fk (x) + o((p n (x)) over the base B 
k = 0 


2 A. Erdelyi (1908-1977) - Hungarian/British mathematician. 


602 


19 Asymptotic Expansions 


holds for every n = 0, 1, . . . , we write 

oo 

fM _ jfnjx), {<p,,(x)} over the base B, 

n= 0 

and we say that we have the asymptotic expansion of the function f over the base B 
in the sense of Erdelyi. 

a) Please note that in Problem 4 you obtained the asymptotic expansion in the 

sense of Erdelyi if you assume ip„ (x) = x~" , n = 0, 1 , 

b) Show that asymptotic expansions in the sense of Erdelyi do not have the 
property of uniqueness (the functions t ft n can be changed). 

c) Show that if a set X, a base B in X , a function / on X, and sequences {/r„ (x ) } 
and {(p n (x)}, the second of which is asymptotic over the base B. are given, then the 
expansion 

OO 

fix ) ~ a n ji n (x ) , {<p„(x)} over the base B, 

n=0 

where a„ are numerical coefficients, is either impossible or unique. 

6. Uniform asymptotic estimates. Let X be a set and Bx a base in X, and let fix, v) 
and g (x , y) be (vector- valued) functions defined on X and depending on the param- 
eter _y e Y . Set |/(x, y)| = a(x, y)|g(x, y)|. We say that the asymptotic relations 

fix,y) = o(g(x,y)), fix,y) = 0(g(x,y)), fix, y) ~ g(x, y) 

are uniform with respect to the parameter y on the set Y if (respectively) a(x,y) 0 
on Y over the base Bx', <x(x, y) is ultimately bounded over the base B x uniformly 
with respect to y e Y\ and finally f = a • g + o(g), where a(jc, y) 1 on 7 over 
the base Bx. 

Show that if we introduce the base B = [B x x Y} in X x Y whose elements are 
the direct products of the elements B x of the base Bx and the set Y . then these 
definitions are equivalent respectively to the following: 

fix,y) = o(gix,y)), f (x, y) = 0(g(x, y)), f{x, y) ~ g(x, y) 

over the base B. 

7. Uniform asymptotic expansions. The asymptotic expansion 

oo 

fix, y) - a n ( y)<p n (x) over the base B x 

n= 0 

is uniform with respect to the parameter y on Y if the estimate r n (x, y) = o((p n (x )) 
over the base Bx in X holds uniformly on Y in the equalities 

n 

fix, y) = YadyfPkix) + r„jx, y), n = 0,1,,... 

*=o 
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a) Let Y be a (bounded) measurable set in R" , and suppose that for each fixed 
x e X the functions fix, y), «o(y), a i (y), . . . are integrable over Y. Show that if 
the asymptotic expansion fix, y ) ~ YffLo (y)<Pn ( x ) over the base Bx is uniform 
with respect to the parameter y e Y , then the following asymptotic expansion also 
holds 


l 


fix, >') dy 



a n iy) dy 


<Pn (X) 


over the base Bx ■ 


b) Let Y — [c, d] C R. Assume that the function fix, y) is continuously differ- 
entiable with respect to y on the closed interval Y for each fixed x e X and for some 
yo € Y admits the asymptotic expansion 


OO 

fix, y 0 ) ~ y^MyoMiO) over the base Bx- 

n = 0 


Prove that if the asymptotic expansion 


9/ 

9y 


OO 

ix,y) ~ y ^a n iy)<Pnix) 

n = 0 


over the base Bx 


holds uniformly with respect to y e Y with coefficients a„ (y) that are continuous 
in y, n = 0, 1 , ... , then the original function fix, y) has an asymptotic expansion 
fix,y) — a niy) ( Pn(x) over the base Bx that is uniform with respect to y e Y, 
its coefficients a n ( v), n = 0, 1, ... are smooth functions of v on the interval Y and 

^(y) = «„(y). 

8. Let p(x) be a smooth function that is positive on the closed interval c <x <d. 

a) Solve the equation | -fix, X) — X 2 pix)ufx, X) in the case when pix) = 1 on 
[c,d]. 

b) Let 0 < m < pix) < M < +00 on [c, d] and let w(c, X) — 1, fj(c, A.) = 0. 
Estimate the quantity u(x, X) from above and below for x e [c, d], 

c) Assuming that lnM(.v,A) ~ Y^Lo c n (x)X l ~ n as X — > + 00 , where co(x), 

cfx), ... are smooth functions and, using the fact that ( 7 -)' = f — (tt) 2 , show 
that c' 0 2 ix) = pix) and (c"_j + c 'k ' c 'n-k>i x ) = °- 


19.2 The Asymptotics of Integrals (Laplace’s Method) 
19.2.1 The Idea of Laplace’s Method 


In this subsection we shall discuss Laplace’s method - one of the few reasonably 
general methods of constructing the asymptotics of an integral depending on a pa- 
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rameter. We confine our attention to integrals of the form 


F(X) = 



(19.1) 


where S(x) is a real-valued function and A is a parameter. Such integrals are usually 
called Laplace integrals. 


Example 1 The Laplace transform 


nm) = 



/(x)e dx 


is a special case of a Laplace integral. 


Example 2 Laplace himself applied his method to integrals of the form 
fa f( x )<P n (x) dx, where neN and ip(x ) > 0 on ]a, b[. Such an integral is also 
a special case of a general Laplace integral (19.1), since <p n (x) = exp (n ln<p(x)). 


We shall be interested in the asymptotics of the integral (19.1) for large values of 
the parameter A, more precisely as A — »• +oo, Ael. 

So as not to become distracted with secondary issues when describing the basic 
idea of Laplace’s method, we shall assume that [a, b] — I is a finite closed interval 
in the integral (19.1), that the functions f(x) and .S'(x) are smooth on I, and that 
S(x) has a unique, strict maximum ,S'(xo) at the point xo € I . Then the function 
exp(AS(x)) also has a strict maximum at xo, which rises higher above the other 
values of this function on the interval I as the value of the parameter A increases. 
As a result, if /(x) ^ 0 in a neighborhood of xo, the entire integral (19.1) can 
be replaced by the integral over an arbitrarily small neighborhood of xo, thereby 
admitting a relative error that tends to zero as A — > +oo. This observation is called 
the localization principle. Reversing the historical sequence of events, one might 
say that this localization principle for Laplace integrals resembles the principal of 
local action of approximate identities and the 5-function. 

Now that the integral is being taken over only a small neighborhood of xo, the 
functions /(x) and S(x) can be replaced by the main terms of their Taylor expan- 
sions as / 3 x — » xo- 

It remains to find the asymptotics of the resulting canonical integral, which can 
be done without any particular difficulty. 

It is in the sequential execution of these steps that the essence of Laplace’s 
method of finding the asymptotics of an integral is to be found. 

Example 3 Let xo = a, S' (a) ^ 0, and f(a) ^ 0, which happens, for example, when 
the function S(x) is monotonically decreasing on [a , /;] . Under these conditions 
/(x) = f (a) + o( 1) and S(x) — S(a) + (x — a) S' (a) + o(l), as / 9 x -* a. Carrying 
out the idea of Laplace’s method, for a small e > 0 and A — »• +oo, we find that 
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F(X) 


ra+e 

J a 


f(a)e 


/(x)e w dx ~ 

,A5(fl) r p AfS'(a) _ _ /( fl ) eA ^ <n> 
JO 


’ dt — 


XS'{a) 

Since S\a) < 0, it follows that in the case in question 


(1 


JS'(«)£ 


)• 


/(a)e 15(fl) 

F(X)~— ; ask— >■ Too. 

XS'(a) 


(19.2) 


Example 4 Let a < xq < b. Then S'ixo) = 0, and we assume that S"(x o) ^ 0, that 
is, S"(x o) < 0, since xo is a maximum. 

Using the expansions /(x) = /(xo) + o(x— xo) and S(x) = S(xo) + ^S"(xo)(x — 
xo) 2 + o((x — xo) 2 ), which hold as x — »■ xo, we find that for small s > 0 and 
X -> +oo 

F(k)~ f ° + /(x)e ASW dx ~ f(x 0 )e XS(xo) f e^ xs " (xij)t2 dt. 

Jxq—s J —e 

Making the change of variable \XS"(xo)t 2 = —u 2 (since S"(xo) < 0), we obtain 


fj 


kS"(x 0 )t z dt _ 


XS"(x o) J—(p(k,e) 


L 




: 1,2 du , 


where ip(X, e) = J — XS ► Too as X 

Taking account of the equality 


— > Too. 



du = s/tc , 


we now find the principal term of the asymptotics of the Laplace integral in this 
case: 


F(X) 


2n 


XS”{x o) 


fix o)e A5(;to) as A. — » +oo. 


(19.3) 


Example 5 If xo = a, but S'(xo) = 0 and S"(xo) < 0, then, reasoning as in Exam- 
ple 4, we find this time that 


F(X)~ f f(x)e XSM dx ~ f(xo)e xs(x °'> [ e z AS " (jco)?2 dt, 
Ja JO 


and so 


F(X)' 


1 / 2jt 


2 V XS"{x o) 


f{x 0 )e xs ^ as X — > Too. 


(19.4) 
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We have now obtained on a heuristic level the three very useful formulas (19.2)- 
(19.4) involving the asymptotics of the Laplace integral (19.1). 

It is clear from these considerations that Laplace’s method can be used success- 
fully in the study of the asymptotics of any integral 


/ f(x,X)dx as X — >• +oo 

Jx 


(19.5) 


provided (a) the localization principle holds for the integral, that is, the integral can 
be replaced by one equivalent to it as X +oo extending over arbitrarily small 
neighborhoods of the distinguished points, and (b) the integrand in the localized 
integral can be replaced by a simpler one for which the asymptotics is on the one 
hand the same as that of the integral being investigated and on the other hand easy 
to find. 

If, for example, the function S(x) in the integral (19.1) has several local maxima 
xq, x\ , . . . , x n on the closed interval [a,b], then, using the additivity of the integral, 
we replace it with small relative error by the sum of similar integrals taken over 

neighborhoods U (xj ) of the maxima xo, xi x n so small that each contains only 

one such point. The asymptotic behavior of the integral 



ie AS(.r) ^ as ^ +oO, 


as already mentioned, is independent of the size of the neighborhood U (xj ) itself, 
and hence the asymptotic expansion of this integral as X — > +oo is denoted F(X,xj) 
and called the contribution of the point xj to the asymptotics of the integral (19.1). 

In its general formulation the localization principle thus means that the asymp- 
totic behavior of the integral (19.5) is obtained as the sum ffj F(X. x f) of the con- 
tributions of all the points of the integrand that are critical in some respect. 

For the integral (19.1) these points are the maxima of the function S(x), and, 
as one can see from formulas (19.2)-(19.4), the main contribution comes entirely 
from the local maximum points at which the absolute maximum of S{x) on [a, b] is 
attained. 

In the following subsections of this section we shall develop the general consid- 
erations stated here and then consider some useful applications of Laplace’s method. 
For many applications what we have already discussed is sufficient. It will also be 
shown below how to obtain not only the main term of the asymptotics, but also the 
entire asymptotic series. 


19.2.2 The Localization Principle for a Laplace Integral 

Lemma 1 (Exponential estimate) Let M = sup ((<l< /, S(x) < oo, and suppose that 
for some value Xq > 0 the integral (19.1) converges absolutely. Then it converges 


19.2 The Asymptotics of Integrals (Laplace’s Method) 


607 


absolutely for every X > Xq and the following estimate holds for such values ofX: 

|/(x)e ww |dx< Ae XM , (19.6) 

where AeK. 



Proof Indeed, for X>Xq, 


l i b 

F(X) |= / f(x)e ls{x) dx 

I J a 


I f f( x ) e ^oSM e a-\ 0 )S(x) d^ 
I J a 


< 


< e (A.-A 0 )M 




Lemma 2 (Estimate of the contribution of a maximum point) Suppose the in- 
tegral (19.1) converges absolutely for some value X — Xq, and suppose that in 
the interior or on the boundary of the interval I there is a point xq at which 
S(xf) — sup fl<r<fo S(x) — M. If f(x) and S(x ) are continuous at xq and f(x o) ^ 0, 
then for every e > 0 and every sufficiently small neighborhood Uj(xo ) of x o in I we 
have the estimate 


[ f(x)e xs(x) dx 

Juaxn) 


> Be 


X(S(x o)-e) 


(19.7) 


with a constant B > 0, valid for X > max{ko, 0}. 


Proof For a fixed s > 0 let us take any neighborhood [7/ (xo) inside which | /(x) | > 
4 1 / (xo) I and >S(xo) — s < S(x) < S(x o). Assuming that / is real-valued, we can 
now conclude that / is of constant sign inside Uj(x). This enables us to write for 
X > maxfkoi 0} 


I L 


f (x)e 


,«(*) 


dx 


£// (x 0 ) 


= f |/M|e A5 ' 

JU,(x 0 ) 

> f *|/(*o)|< 

JUr(xn) Z 


dx > 


.MSCvol-e)^ _ fl s HS(x o)-e) 


□ 


Proposition 1 (Localization principle) Suppose the integral (19.1) converges abso- 
lutely for a value X = A.o, and suppose that inside or on the boundary of the interval 
I of in tegration the function S(x) has a unique point xo of absolute maximum, that 
is, outside every neighborhood JJ (xq) of the point xq we have 


sup S(x) < S(x o). 
I\U(x o) 


If the functions f(x) and S(x ) are continuous at xq and f (xo) ^ 0, then 

F(X)= F Ul(xti) (k)(] + 0(;A°°)) asX^+oo, (19.8) 
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where Uj(x o) is an arbitrary neighborhood ofxQ in /, 

F Ul{x o)W:=( /(x)e AS « dx, 

JUi(x o) 

and 0(X~°°) denotes a function that is o(X~ n ) as X — >• +oo for every n e N. 

Proof It follows from Lemma 2 that if the neighborhood Uj (xo) is sufficiently 
small, then the following inequality holds ultimately as X —*■ +oo for every e > 0 

|ft/ / (x 0) W| (19.9) 

At the same time, by Lemma 1 for every neighborhood U (xo) of the point xo we 
have the estimate 


/ |/(x)|e ASW dx < Ae^ asl->+oo, (19.10) 

J I\U(xq) 

where A > 0 and p, — *up x€l \ U ( xo) S(x) < ^(xo). 

Comparing this estimate with inequality (19.9), it is easy to conclude that in- 
equality (19.9) holds ultimately as X — >■ +oo for every neighborhood U / (xo ) of xq. 
It now remains only to write 

F(X) = Fj(X) — Fuj( x 0 )(A.) + Fj\u( X0 )(X), 

and, citing estimates (19.9) and (19.10), conclude that (19.8) holds. □ 

Thus it is now established that with a relative error of the order 0(X ~°° ) as 
X —> +oo when estimating the asymptotic behavior of the Laplace integral, one can 
replace it by the integral over an arbitrarily small neighborhood U j (xo) of the point 
xq where the absolute maximum of S(x ) occurs on the interval I of integration. 


19.2.3 Canonical Integrals and Their Asymptotics 

Lemma 3 (Canonical form of the function in the neighborhood of a critical point) 
If the real-valued function .S(x) has smoothness C^ n+k ^ in a neighborhood (or one- 
sided neighborhood) of a point xo € R, and 

S'(x 0 ) = ■■■ = S ( "- l) (x o) = 0, S (n) (x 0 ) # 0, 

and leN or k — oo, then there exist neighborhoods (or one-sided neighborhoods ) 
I x of xq and I y of 0 in R and a diffeomorphism <p e C ik fl y , I x ) such that 

S(<p(yj) — S(x o) + sy n , when y e I y and s = sgnS (n) (xo). 
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Here 

, ( n\ \ 1/n 

w( 0) = .Vo and <p (0) = I — — ) 

Proof Using Taylor’s formula with the integral form of the remainder, 

S(x) = S{xq) + — — — / S {n) (x() + t(x - x 0 ))(l - 0" _1 d T 

(n - 1)! J q 

we represent the difference S(x) — S(x o) in the form 

S(x) - S(x 0 ) — (x - x 0 ) n r(x), 

where the function 

1 C 1 

r (x) = - — / S (n) (*o + t(x - xo ) ) ( 1 - 0" -1 d T 

(n - 1 )! Jo 

by virtue of the theorem on differentiation of an integral with respect to the pa- 
rameter x, belongs to class C®, and r(x o) = ^jS^(xo) ^ 0. Hence the function 
y = Jr(x) = (x — xo)-iy\r(x)\ also belongs to C® in some neighborhood (or one- 
sided neighborhood) I x of xo and is even monotonic, since 

In this case the function i fr on I x has an inverse \fr~ l = tp defined on the interval 
I y = Jr(I x ) containing the point 0 = Jr{x o). Here q> e C^ k) (I v , Ix)- 

Further, cpfO) = (J/'(xo))~ l — ( |g(„” ( ! ro) | ) 1 ^'- Finally- by construction S(<p(y)) = 
S (to) + sy n , where 5 = sgn r (xo) = sgn S'® (xo) . □ 

Remark 1 The cases n = 1 or n = 2 and k — 1 or k — oo are usually the ones of 
most interest. 

Proposition 2 (Reduction) Suppose the internal of integration I = [a, b] in the in- 
tegral (19.1) is finite and the following conditions hold: 

a) /,SeC(/,M); 

b) max A .g/ S(x) is attained only at the one point xp e /; 

c) SeC®(C//(x 0 ),R) in some neighborhood Uj (xo) of x o ( inside the inter- 
val /); 

d) S® (x o) ^ 0 and if l <n, then S® (xo) = • • • = S (,!_1) (xo) = 0. 

Then as +oo the integral (19.1) can be replaced by an integral of the form 

R(X) = e xs(xo) f r(y)e“ A - v "dy 

J Iy 
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with a relative error defined by the localization principle (19.8), where I y = [— e, e] 
or I y = [0, e], s is an arbitrarily small positive number ; and the function r has the 
same degree of smoothness I y that f has in a neighborhood of xq. 

Proof Using the localization principle, we replace the integral (19.1) with the inte- 
gral over a neighborhood l x = Ui(x q) of xq in which the hypotheses of Lemma 3 
hold. Making the change of variable x — <p(y), we obtain 


f(x)f- S{x) dx=( f f{<p(y))(p'(y)e~ ky " dy 


e AS(x 0 ) 


(19.11) 



The negative sign in the exponent {—Xy n ) comes from the fact that by hypothesis 


□ 


xq = <p(0) is a maximum. 


The asymptotic behavior of the canonical integrals to which the Laplace integral 
(19.1) reduces in the main cases is given by the following lemma. 

Lemma 4 (Watson 3 ) Let a>0, /3>0, 0<a< oo, and f e C([0, a], M). Then 
with respect to the asymptotics of the integral 



(19.12) 


as X — > +oo, the following assertions hold : 

a) The main term of the asymptotics o/(19.12) has the form 


= -f(0)r(P/a)X~% + o(x~^), 
a 


(19.13) 


if it is known that f(x) — /( 0) + O(x) as — > 0. 

b) If f(x) — ao + a\x H h a„x n + 0(x n+x ) as x -* 0, then 



(19.14) 


c) If f is infinitely differentiable at x = 0, then the following asymptotic expan- 
sion holds : 



(19.15) 


which can be differentiated any number of times with respect to X. 


3 G.H. Watson (1886-1965) - British mathematician. 
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Proof We represent the integral (19.12) as a sum of integrals over the interval ]0, e] 
and [e, ci[, where e is an arbitrarily small positive number. 

By Lemma 1 


|jf x p - x f(x)&- Xxa dx 


< Ae, Xs = 0(X 00 ) as X -> +oo, 


and therefore 


W(X) = f x^‘ 1 /(x)e ' L ' a dx + 0{X °°) asl-^+oo. 

Jo 


In case b) we have f{x) — J2k=0 akxk + r » U), where r n e C[0, e] and \r n (x) | < 
Cx" +1 on the interval [0, e]. Hence 


W(X) = 


Taj x k+ P *e Xx dx + c(A) [ x" + ^e Xx dx + o(A °°), 
*=o Jo Jo 


where c(X) is bounded as X -* +oo. 
By Lemma 1, as X — »■ +oo, 


But 


f £ x k+ ^~ 1 e~ Xx “ 

Jo 


dx = 


r+oo 

/ x k+ P- 1 

Jo 


-Xx a 


dx + 0(X-°°). 



x k+p-l e -W 


1 

dx = — r 

a 


k + 

I A “ 

a ) 


from which formula (19.14) and the special case of it, formula (19.13), now follow. 

The expansion (19.15) now follows from (19.14) and Taylor’s formula. 

The possibility of differentiating (19.15) with respect to X follows from the fact 
that the derivative of the integral (19.12) with respect to the parameter X is an in- 
tegral of the same type as (19.12) and for W'(X) one can use formula (19.15) to 
present explicitly an asymptotic expansion as X -* +oo that is the same as the one 
obtained by formal differentiation of the original expansion (19.15). □ 


Example 6 Consider the Laplace transform 

r+oo 

F(X)= / f(x)e~ Xx dx, 

Jo 

which we have already encountered in Example 1 . If this integral converges abso- 
lutely for some value X — Xq and the function / is infinitely differentiable at x = 0, 
then by formula (19.15) we find that 

OO 

F(A)~^/®( 0)A" ( * +1) 
k = 0 


as X —¥■ +oo. 
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19.2.4 The Principal Term of the Asymptotics of a Laplace Integral 

Theorem 1 (A typical principal term of the asymptotics) Suppose the interval of in- 
tegration 1 = [a, b] in the integral (19.1) is finite, f,S& C(I, M), and max A6 / S(x) 
is attained only at one point xo e I . 

Suppose it is also known that /(xo) ^ 0, f(x) = /(xo) + 0(x — xo) for I 3 x — >■ 
xo, and the function S belongs to C® in a neighborhood of xo. 

The following statements hold. 

a) If x 0 — a, k = 2, and S'(x 0 ) ^ 0 ( that is, S'(x 0 ) < 0), then 

F(k) = ^ (Ao) e A5(A:o) l~ l ri + Ofl" 1 )! asX^+oo; (19.2') 

— S'(x 0 ) 

b) if a < xo < b, k — 3, and S"(xo) f 0 ( that is, S"(x 0 ) < 0), then 


F(k) = , 


2 it 


-S"(x 0 ) 


/(x 0 )e AS(xo) A.“ 1/2 [l + 0(A.“ 1/2 )] ask ^+ 00 ; (19.3') 


c) ifx 0 — a, k = 3, Sfa) — 0, and S' 1 (a) 0 ( that is, S"(a) < 0), then 

F(k) = / ^ o) /(x 0 )e 15( * o) A.- 1/2 [l + as x~> + 00 . (19.4') 


Proof Using the localization principle and making the change of variable x — tpiy) 
shown in Lemma 3, according to the reduction in Proposition 2, we arrive at the 
following relations: 

a) F{X) = e xs ^(j\foip){y)ip\y) & - x ydy+ O^" 00 )); 

b) F(X) — f (/o^V^e-^dy+O^- 00 ^ 

= e AS(jro) (J^ ((/ o tpfiyWiy) + (/ o <p)(-y V {-y))e ~ Xy2 dv + 


+ o(k-°°)\ 

c) F(k) = e ls ^(J\f o (p){y)(p\y)eT Xyl dy + 0{k~°°) 

Under the requirements stated above, the function (/ o (p)cp 1 satisfies all the hy- 
potheses of Watson’s lemma. It now remains only to apply Watson’s lemma (for- 
mula (19.14) for n = 0) and to recall the expressions for <p( 0) and q>'{ 0) indicated in 
Lemma 3. □ 


Thus we have justified formula (19.2)— (19.4) together with the remarkably sim- 
ple, clear and effective recipe that led us to these formulas in Sect. 19.1. 


19.2 The Asymptotics of Integrals (Laplace’s Method) 


613 


Now let us consider some examples of the application of this theorem. 


Example 7 The asymptotics of the gamma function. The function 

f +oo 


m + i)= f 

Jo 

can be represented as a Laplace integral 

r(x + 1 ) = 


t x e~' dr (k>-l) 


n+OO 

{ 


e- f e^ ln 'd t, 


and if for X > 0 we make the change of variable t —Xx, we arrive at the integral 

"+OO 


r h 

r(x + i) = x x+l 

Jo 


a —X(x—lnx') 


dr, 


which can be studied using the methods of the theorem. 

The function S{x) — In x — x has a unique maximum x — 1 on the interval 
]0, +oo[, and S" ( 1 ) = —1. By the localization principle (Proposition 1) and as- 
sertion b) of Theorem 1 , we conclude that 

r(x+i) = V2jrx(^J [l + o(i“ 1/2 )] as X — »• Too. 

In particular, recalling that r (n + 1) = n\ for n e N, we obtain the classical 
Stirling ’s formula 4 

n\ = \/2nn(n/e) n \\ + 0(n _1//2 )] as n — »■ oo, ne N. 

Example 8 The asymptotics of the Bessel function 

i r 

I n (x) = — / 

X Jo 


e xcosd CQsnddO, 


where n e N. Here f(9) = cos n9,S(9) = cos 0, maxo < x <n S(9) = 5X0) = 1, 
5'(0) = 0, and 5"(0) = — 1, so that by assertion c) of Theorem 1 

Q X 

T„(x) = [l T D(r~ 1/2 )] asr->Too. 

•Jinx 

Example 9 Let / e C (1) ([a,b], M), 5 e C^ 2 \[a,b],M), with 5(x) > 0 on [a, b], 
and max a < x <b S(x) is attained only at the one point xo € [a,b]. If /(x o) J 0, 
5 , (xo) = 0, and 5"(x o) f 0, then, rewriting the integral 


T(X) = 


pb 

/ /(^)[5(x)] ; 

J a 


dx 


4 See also Problem 10 of Sect. 7.3. 
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in the form of a Laplace integral 

rb 

T(X) = / f(x)e UnSM dx, 

Ja 

on the basis of assertions b) and c) of Theorem 1, we find that as X —*■ +oo 

E(X) = sf(x 0 J-^- ) [S(x 0 )f +l/2 X- 1 ' 2 [l + 0(X~ x l 2 )\, 

where e = 1 if a < xo < b and e — 1/2 if xo = a or xo = b. 

Example 10 The asymptotics of the Legendre polynomials 

P n (x) = — / (x + sj x 2 — 1 cos#)" dQ 

n Jo 

in the domain x > 1 as n -» oo, n e N, can be obtained as a special case of the 
preceding example when / = 1 , 

S(6) — x + \! x 2 — 1 cos#, max S(0) = S(0) = x + s/ x 2 — 1, 

0<0<7T 

S'( 0) = 0, S"( 0) = -six 2 - 1. 

Thus, 

(x + V X 2 — l) n+1 / 2 - 1 / 9 M 

P n (x) = — [l + 0(n I )] as n — > +oo, n e N. 

sj'lnnsl x~ — 1 


19.2.5 * Asymptotic Expansions of Laplace Integrals 

Theorem 1 gives only the principal terms of the characteristic asymptotics of a 
Laplace integral (19.1) and even that under the condition that /(xo) / 0. On the 
whole this is of course a typical situation, and for that reason Theorem 1 is un- 
doubtedly a valuable result. However, Watson’s lemma shows that the asymptotics 
of a Laplace integral can sometimes be brought to an asymptotic expansion. Such a 
possibility is especially important when /(xo) = 0 and Theorem 1 gives no result. 

It is naturally impossible to get rid of the hypothesis /(xo) / 0 completely, 
without replacing it with anything, while remaining within the limits of Laplace’s 
method: after all, if /(x) = 0 in a neighborhood of a maximum xo of the function 
S(x) or if /(x) tends to zero very rapidly as x — »■ xo, then xo may not be respon- 
sible for the asymptotics of the integral. Now that we have arrived at a certain type 
of asymptotic sequence {e Xc X~ Pk } (po < p\ < ■ ■ ■ ) as X —>■ +oo as a result of the 
considerations we have studied, we can speak of an asymptotic zero in relation to 
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such a sequence and, without assuming that /(x o) ^ 0, we can state the localiza- 
tion principle as follows: Up to an asymptotic zero with respect to the asymptotic 
sequence {U s(x o'>}-Pk } c pi < •••) the asymptotic behavior of the Laplace in- 
tegral ( 1 9. 1 ) a? k — >■ +oo equals the asymptotics of the portion of this integral taken 
over an arbitrarily small neighborhood of the point xo, provided that this point is 
the unique maximum of the function S(x ) on the interval of integration. 

However, we shall not go back and re-examine these questions in order to sharpen 
them. Rather, assuming / and S belong to C (0O \ we shall give a derivation of the 
corresponding asymptotic expansions using Lemma 1 on the exponential estimate, 
Lemma 3 on the change of variable, and Watson’s lemma (Lemma 4). 

Theorem 2 (Asymptotic expansion) Let I = [a, b] be a finite interval, f,S e 
C(I, R), and assume max ie / S(x) is attained only at the point xq e / and f,SE 
C&Hurtx o),R) in some neighborhood Ui(x o) of x o. Then in relation to the 
asymptotics of the integral (19.1) the following assertions hold. 

a) If x o = a, S <m - > (a) 0, S^'fa) — 0 for 1 < j < m, then 

oo 

F(X)^X~ l/m e kS(a) J2a k k~ k/m asX-*+oo, (19.16) 

k = 0 


where 


a k = 


(-1 ) k+l m k /k + 


k\ 


r 


k ~^p) a)) \ x=a , 


h(x, a) = ( S(a ) - S(x))' 1/m /S\x). 
b) If a < xo < b, S (2 " !) Uo) ^ 0, and S^\x o) = 0 for 1 < j < 2m, then 


F(X)~X- l/2m e kS(xo) ^c k X- k/m asX->+o o, (19.17) 


k = o 


where 


c k = 2 


(— 1 ) 2k + l (2m) 2k {2k+\ 


(2k) \ 


r 


2m 


d \ 2k 

h(x,x 0 )—j {f(x)h(x,xfj)\ x 


h(x, xq) = (S(xo) — S(x)) 1 ^/S\x). 


c) If f (n) (x 0 ) ^ 0 and f (x) ~ hf (n) (x 0 )(x - x 0 ) ,? as x — y xo, then the main 
term of the asymptotics in the cases a) and b) respectively has the form 


1 n + 1 

F(X) = —X~-^r 
m 


gkS(a) p 


n + 1 


S m (a) | 


n+l 


-f (n \a) + 0(X- n -£) 

n\ 


x 


(19.18) 
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F[k) = -k~^^ S(x o) 


r 


(2m)! 

2m ) V I S' 2 "’ (xq) | 


n+1 

2m 


1 
n\ 


(«) 


C*o) + 0(X &» ) 


(19.19) 


d) The expansions (19.16) and (19.17) can be differentiated with respect to X 
any number of times. 


Proof It follows from Lemma 1 that under these hypotheses the integral (19.1) can 
be replaced by an integral over an arbitrarily small neighborhood of xq up to a 
quantity of the form Q XS ( x °i O (X~°°) as X — »■ oo. 

Making the change of variable x =<p{y) from Lemma 3 in such a neighborhood, 
we bring the last integral into the form 

e -A.S(x 0 ) f (/ o (p)(y)(p' (y)e~ Xya dy, (19.20) 

Jly 

where I y = [0, e], a = m if xo = a, and I y = [— e, e], a = 2m if a < xq < b. 

The neighborhood in which the change of variable x = <p(y) took place can be 
assumed so small that both functions / and S are infinitely differentiable in it. Then 
the resulting integrand (/ o (p)(y)(p'(y ) in the integral (19.20) can also be assumed 
infinitely differentiable. 

If I y = [0, e], that is, when xo = a, Watson’s lemma is immediately applicable to 
the integral (19.20) and the existence of the expansion (19.16) is thereby proved. 

If I y = [— e, e], that is, in the case a < xq < b. we bring the integral (19.20) into 
the form 

e xs(X0> ( [(/ o <p)(y)<p\y) + (/ o (p)(-y)(p\-y)\e~ x2 "' d y, (19.21) 

Jo 

and, once again applying Watson’s lemma, we obtain the expansion (19.17). 

The possibility of differentiating the expansions (19.16) and (19.17) follows from 
the fact that under our assumptions the integral (19.1) can be differentiated with 
respect to X to yield a new integral satisfying the hypotheses of the theorem. We 
write out the expansions (19.16) and (19.17) for it, and we can verify immediately 
that these expansions really are the same as those obtained by formal differentiation 
of the expansions (19.16) and (19.17) for the original integrals. 

We now take up the formulas for the coefficients a^ and c*. By Watson’s lemma 
Ok = (0)r(*±i), where <P{y) = (/ o <p)( y )<p'(y). 

However, taking account of the relations 

5(^00) - S(a) = —y m , 

S'(x)(p'(y) = -my m ~ l , 

< p' (y) — —m[S(a) — S(x)) 1 ™/S\x), 
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d 

d y 



&(y) = f(x)<p'(y). 


we obtain 

d k & d \ k . , 

dyr(° ) = (-«) \ h(x ' a) ^) (fM h (-X,o))\ x= a, 

where h(x, a) = ( S(a ) — S(x)) 1_ m /S\x). 

Formulas for the coefficients q can be obtained similarly by applying Watson’s 
lemma to the integral (19.21). 

Setting i j/(y) = /(<p(y))</(y) + f{<p{-y)W{-y), we can write, as X -* +oo, 



1 

2m 


OO 


E 

72=0 


t {n Ho ) r /» + i 

n ! \ 2m 


B + l 


X 2 "< . 


But, t/r <2i+1 ^(0) = 0 since t Js(y) is an even function; therefore this last asymptotic 
expansion can be rewritten as 


f 


i/r(y)e 


-Xy l] 


dy 


j_“ /a + ix « 

2m (2 k)\ V 2m ) 


It remains only to note that \[r^ 2k \0) = 20 (2k) (O), where 0{y) — f(<p(y))(p'(y)- 
The formula for c* can now be obtained from the already established formula for a * 
by replacing k with 2k and doubling the result of the substitution. 

To obtain the principal terms (19.18) and (19.19) in the asymptotic expansions 
(19.16) and (19.17) under the condition /(x) = ^f {n> (xq)(x — xo)" + 0((x — 
x 0 )" +1 ) indicated in c), where / <n, (x o) ^ 0, it suffices to recall that x = (p(y), 
x'o = <p( 0), x — xo = <p'(0)y + 0(y 2 ), that is, 

(/ o <p)(y) = y n ( J- ~~~~ (y^(Q))” + 0(y)j 


and 

(/ o <p)(y)<p'(y ) = (</(0))” +1 + 0(y)^ 

as y 0, since ^'(0) = (^g^) 1/m ^ 0 if x 0 = a and <p'{ 0) = (^g^) 1/2m 7^ 
0 if a < xo < b. 

It now remains only to substitute these expressions into the integrals (19.20) and 
(19.21) respectively and use formula (19.13) from Watson’s lemma. □ 


Remark 2 We again get formula (19.2') from formula (19.18) when n = 0 and 
m = 1 . 
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Similarly, when n = 0 and m = 1 formula (19.19) yields (19.3') again. 

Finally, Eq. (19.4') comes from Eq. (19.18) with n = 0 and m = 2. 

All of this, of course, assumes the hypotheses of Theorem 2. 

Remark 3 Theorem 2 applies to the case where the function S(x) has a unique max- 
imum on the interval I = [a, b]. If there are several such points x \, . . . , x n , the inte- 
gral (19.1) is partitioned into a sum of such integrals, each of whose asymptotics is 
described by Theorem 2. That is, in this case the asymptotic behavior is obtained as 
the sum F(X, x j) °f the contributions of these maximum points. 

It is easy to see that when this happens, some or even all of the terms may cancel 
one another. 

Example 11 If S e C <00, (R, R) and S(x) — > — oo as x — »■ oo, then 



Hence, in this case such an interference of the contributions must necessarily occur. 
From the formal point of view this example may seem unconvincing, since previ- 
ously we had been considering the case of a finite interval of integration. However, 
those doubts are removed by the following important remark. 

Remark 4 To simplify what were already very cumbersome statements in The- 
orems 1 and 2 we assumed that the interval of integration I was finite and 
that the integral (19.1) was a proper integral. In fact, however, if the inequality 
sup^y^ S{x) < SC ro) holds outside an interval U(x o) of the maximum point 
xo el, then Lemma 1 enables us to conclude that the integrals over intervals strictly 
outside of U (xo) are exponentially small in comparison with e A '' ,(A ° ) as X — >■ +oo 
(naturally, under the assumption that the integral (19.1) converges absolutely for at 
least one value X = Xq). 

Thus both Theorem 1 and Theorem 2 are also applicable to improper integrals if 
the conditions just mentioned are met. 

Remark 5 Due to their cumbersome nature, the formulas for the coefficients ob- 
tained in Theorem 2 can normally be used only for obtaining the first few terms of 
the asymptotics needed in specific computations. It is extremely rare that one can 
obtain the general form of the asymptotic expansion of even a simpler function than 
appears in Theorem 2 from these formulas for the coefficients a & and c^. Neverthe- 
less, such situations do arise. To clarify the formulas themselves, let us consider the 
following examples. 

Example 12 The asymptotic behavior of the function 
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as x -» +oo is easy to obtain through integration by parts: 


Erf(x) = 


1 f+°° 


2x 


L 


u 2 e " du = 


2 2 
e~ x 3e~ x 


2x 2 2 x 3 

from which, after obvious estimates, it follows that 

r 2 oo 


r+oo 

7 ' 

J x 


u 4 e “ dit = ■ ■ ■ , 




e~ x * ^ (- l) k (2k- 1)!! 


2k 


k = 0 


2* 


as x — > +oo. 


(19.22) 


Let us now obtain this expansion from Theorem 2. 

By the change of variable u — xt we arrive at the representation 

r + °° 2 2 

Erf(x) — X J e~ x ' d t. 


Setting X = x 2 here and denoting the variable of integration, as in Theorem 2, by x, 
we reduce the problem to finding the asymptotic behavior of the integral 

/ °° 2 

e“ Av dx, (19.23) 


since Erf(x) =xF(x 2 ). 

When Remark 4 is taken into account, the integral (19.23) satisfies the hypothe- 
ses of Theorem 2: S(x) = —x 2 , S' (x) — —2x < 0 for 1 < x < +oo, ^^l) = —2, 

S(D = -1. 

Thus, xo = a = \,m = l, f (x) = 1, h(x, a) = 7>y, h(x,a)^ — 37- 

Hence, 


1 


-2x dx 

1 d 

-2x dx 

1 d 
-2x dx 


1 d 
— 2x dx 



= (4)VlH-3>,-\ 


( M (2k-m (2k+l) 

V 2 x) 2 k + l 
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Setting x = 1 , we find that 


(- 1)* +1 

o* = - — — r(k+ 1) 

k\ 


(2k — 1 )!! \ 

2 k+i J 


(~l) k 


(2k- 1)!! 
2^+1 


Now writing out the asymptotic expansion (19.16) for the integral (19.23) taking 
account of the relations Erf(;t) = xF(x 2 ), we obtain the expansion (19.22) for the 
function Erf(x) as x — > +oo. 


Example 13 In Example 7, starting from the representation 

r+oo 

r(X+ l) = k x+1 / e~ Kx - lnx) dx, (19.24) 

J o 

we obtained the principal term of the asymptotics of the function r(X + 1) as X —> 
+oo. Let us now sharpen the formula obtained earlier, using Theorem 2b). 

To simplify the notation a bit, let us replace x by x + 1 in the integral (19.24). 
We then find that 


/ +oo 

e A.(ln(l+*)-*) d ( 

and the question reduces to studying the asymptotics of the integral 

/ +oo 

e A(ln(l+x)-x) dx ( 19 . 25 ) 

as X — »■ +oo. Here S(x ) = ln(l +x) — x, S'(x ) = — 1, S'(0) = 0, that is, xq = 0, 

S"(x) = — , S"(0) = —1^0. That is, taking account of Remark 4, we see that 

the hypotheses b) of Theorem 2 are satisfied, where we must also set f(x) = 1 and 
m — 1, since ^'"(O) ^ 0. 

In this case the function h(x,x o) = h(x) has the following form: 

h(x) — ——(x — ln(l + x)) V '. 

x 

If we wish to find the first two terms of the asymptotics, we need to compute the 
following at x = 0: 
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(*<*)£) (*<*>) = = 



VdV 

t 2 d“/7 

h(x) 


J (x) + h(x)— j(x) 


This computation, as one can see, is easily done if we find the values h{ 0), h'(0), 
h"(0), which in turn can be obtained from the Taylor expansion of h (x ) , x > 0 in a 
neighborhood of 0: 


h(x) = — 


1 + x 


X 

1 + X 


x — ( x — -x 2 + - x 3 — -X 2 + 0(x 5 ) 


X 

1 + X 


\x 2 -\^ + \x A + 0{x 5 ) 


1/2 


2 2 9 , 

! - -X + -x 2 + 0(x 3 ) 


1/2 


V2 L 

■ 1 ^( | -T + ^ 2 + 0 < J ‘ 3 )) = 

1 V2 5 9 / 39 

-— = - —x + -x- + <9(x 3 . 

x/2 3 36V2 


1/2 


Thus, h( 0) = /T(0) = /t"(0) = 


189/2’ 



F(A.) = + A^ -1 + 0(A“ 2 )^, 
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that is, as X +oo, 


r(k+ l) = V2jrX^ (i + ^A^ + O^ -2 )). (19.26) 


It is useful to keep in mind that the asymptotic expansions (19.16) and (19.17) can 
also be found by following the proof of Theorem 2 without invoking the expressions 
for the coefficients shown in the statement of Theorem 2. 

As an example, we once again obtain the asymptotics of the integral (19.25), but 
in a slightly different way. 

Using the localization principle and making a change of variable x = cp(y) in a 
neighborhood of zero such that 0 = (p(0), S((p{y)) = ln(l + <p(y)) — <p(y) = —y 2 , 
we reduce the problem to studying the asymptotics of the integral 



where \[r(y) = (p'(y ) + <p'(—y )• The asymptotic expansion of this last integral can 
be obtained from Watson’s lemma 



T/f(y)e dy 


1 y* r k) (0) r ( £ + 1 V-(fc+l)/2 
2 h k] v 2 ; 


as /. ■ 


+oo, 


which by the relations i/r (2t+1) (0) = 0, ^/ f2k HQ) = 2(p (2k+l> (0) yields the asymp- 
totic series 


OO 


E 


cp^k+^iQ) 
(2 k!) 


n ^- 2 


x -(k+ 1/2) = x -l/2 p 



<pW +l H0) _ k 

k\2 2k 


Thus for the integral (19.25) we obtain the following asymptotic expansion 


°° . 0 (2*+l) f( y, 

- ^ _1/2 ^ E x "*■ ( 19 - 27 ) 

k = 0 

where .r = <p(y) is a smooth function such that x — ln(l +x) = y 2 in a neighborhood 
of zero (for both x and y ). 

If we wish to know the first two terms of the asymptotics, we must put the specific 
values (p'(0) and (p^\ 0) into formula (19.27). 

It may be of some use to illustrate the following device for computing these 
values, which can be used generally to obtain the Taylor expansion of an inverse 
function from the expansion of the direct function. 

Assuming x > 0 and y > 0, from the relation 

x — ln(l + x) = y 2 
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we obtain successively 

\ x2 {^~\ x + \* 2 + °(- x3 )) = y 2 ' 

/ 9 1 \ — l/ 2 

x = V2y( 1 — -x + -x 2 + 0(x 3 ) j = 

= V2y^l + \x- -^x 2 + tf(x 3 )^ = 

r — \/2 \/2 o / q, 

= V2y+ — yx- — yx 2 + 0(yx 3 ). 

But x ~ ~j2y as y — »■ 0 (x -> 0), and therefore, using the representation of x 
already found, one can continue this computation and find that as y -» 0 

x = V2y + ^y^V2y+ ^yx+ 0(y 3 )^j - ^y(V2y) 2 + 0(y 4 ) = 

= V2y + 2 -y 2 + 2 -y 2 x-^y 2 + 0(/) = 

= V2y + 2 -y 2 + ^y 2 (V2y) - ^y 3 + 0(y 4 ) = 

= V2y + ^y 2 + ^y 3 + 0(y 4 ). 

Thus for the quantities <p\0) and i^> (3) (0) of interest to us we find the following 
values: ip'( 0) = V2, ^> <3 ^(0) = 

Substituting them into formula (19.27), we find that 

F(X) = X~ 1/2 V2ti(i + -j l -X~' + 0(k“ 2 )^ as A. -* +oo, 

from which we again obtain formula (19.26). 

In conclusion we shall make two more remarks on the problems discussed in this 
section. 

Remark 6 (Laplace’s method in the multidimensional case) We note that Laplace’s 
method can also be successfully applied in studying the asymptotics of multiple 
Laplace integrals 

F(X)= I /(x)e ASW dx, 

Jx 

in which x e M", X is a domain in M", and / and S are real- valued functions in X. 
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Lemma 1 on the exponential estimate holds for such integrals, and by this lemma 
the study of the asymptotics of such an integral reduces to studying the asymptotics 
of a part of it 


L 


U(x o) 


/ (x)e ks(x> dx, 


taken over a neighborhood of a maximum point xo of the function S(x). 

If this is a nondegenerate maximum, that is, S"(x o) ^ 0, then by Morse’s lemma 
(see Sect. 8.6 of Part 1) there exists a change of variable x = <p(y ) such that S(xo) — 

S(<p(y )) = |y| 2 , where \y\ 2 = (y 1 ) 2 + b (y") 2 . Thus the question reduces to the 

canonical integral 

(/ ° <p)(y)det(p'{y)er x \ y]2 dy, 

which in the case of smooth functions / and S can be studied by applying Fubini’s 
theorem and using Watson’s lemma proved above (see Problems 8-1 1 in this con- 
nection). 


Remark 7 (The stationary phase method) In a wider interpretation, Laplace’s 
method, as we have already noted, consists of the following: 

1° a certain localization principle (Lemma 1 on the exponential estimate), 

2° a method of locally reducing an integral to canonical form (Morse’s lemma), 
and 

3° a description of the asymptotics of canonical integrals (Watson’s lemma). 

We have met the idea of localization previously in our study of approximate iden- 
tities, and also in studying Fourier series and the Fourier transform (the Riemann- 
Lebesgue lemma, smoothness of a function and the rate at which its Fourier trans- 
form decreases, convergence of Fourier series and integrals). 

Integrals of the form 


F(X) = [ f{x)e ixs{x) dx, 

Jx 

where x el", called Fourier integrals, occupy an important place in mathematics 
and its applications. A Fourier integral differs from a Laplace integral only in the 
modest factor i in the exponent. This leads, however, to the relation \e l, s(x '> = i 
when X and S(x) are real, and hence the idea of a dominant maximum is not appli- 
cable to the study of the asymptotics of a Fourier integral. 

Let X — [a, b] C M 1 , / e Cg^da, fc],M), (that is, / is of compact support on 
[a, b]), S e C (oo) (|a, b], R) and S'(x) ^ 0 on [a, b]. 

Integrating by parts and using the Riemann-Lebesgue lemma (see Problem 12), 
we find that 

[ ' f(xW ls ^ dr = - I' ^ de i; - s '« = 

Ja J a S'(X) 
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1 

iX 



(x)e ils(x) dx = 


1 

I 



as W dx = • 


1 

I” 



(x)e as ^ dx = 


= o(X-") 


as X ■ 


oo. 


Thus if 5'(0) / 0 on the closed interval [a, b], then because of the constantly 
increasing frequencies of oscillation of the function as X — »■ oo. the Fourier 

integral over the closed interval [a, b] turns out to be a quantity of type <9(k _0 °). 

The function Six) in the Fourier integral is called the phase function. Thus the 
Fourier integral has its own localization principle called the stationary phase prin- 
ciple. According to this principle, the asymptotic behavior of the Fourier integral 
as X — > oo (when f e Cq°° ) ) is the same as the asymptotics of the Fourier integral 
taken over a neighborhood U (xo) of a stationary point xo of the phase function (that 
is, a point xo at which S\x o) = 0) up to a quantity 0(X~°°). 

After this, by a change of variable the question reduces to the canonical integral 


E(X) = 



dx 


whose asymptotic behavior is described by a special lemma of Erdelyi, which plays 
the same role for the Fourier integral that Watson’s lemma plays for the Laplace 
integral. 

This scheme for investigating the asymptotics of a Fourier integral is called the 
stationary phase method. 

The nature of the localization principle in the stationary phase method is com- 
pletely different from its nature in the case of the Laplace integral, but the general 
scheme of Laplace’s method, as one can see, remains applicable even here. 

Certain details relating to the stationary phase method will be found in Prob- 
lems 12-17. 


19.2.6 Problems and Exercises 


Laplace’s Method in the One-Dimensional Case 

1. a) For a > 0 the function h(x ) = e~ / A attains its maximum when x = 0. Flere 
h(x) is a quantity of order 1 in a <5 -neighborhood of x = 0 of size 8 — 0(X~ l ^ a ). 
Using Lemma 1, show that if 0 < 5 < 1, then the integral 

r a 

W(X)= / x^- 1 / (x)e -Aj “dx, 

■S-l —A l 8 

where c(A, 8) = X « has order 0(e ) as X —> +oo, A being a positive constant. 
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b) Prove that if the function / is continuous at x = 0, then 

W(A.) = a _1 r(^/a)[/(0) +o(l)]X~ fl/c ‘ as k ^ +oo. 

c) In Theorem la), the hypothesis f(x) — /(x o) + 0(x — xo) can be weakened 
and replaced by the condition that / be continuous at xo. Show that when this 
is done the same principal term of the asymptotics is obtained, but in general not 
Eq. (19.2') itself, in which 0(x — xq) is now replaced by o(l). 

2. a) The Bernoulli numbers Bjk are defined by the relations 


i l — = - l --T—t 2k -\ |t| <2,. 

t 1 — e,~' 2 ^ (2k)\ 

k= 1 v ’ 


It is known that 


r' , 

t )m = 




Show that 


r'\ . . . i ^ B lk _ 2k 

— )(x)~lnx- — x - asx^+oo. 
*=o 


b) Prove that as x -> +oo 

lnF(x) ~ (x lnx — x H — ln2:r + — x _2A:+1 . 

v 2) 2 ^ 2k(2k — 1) 

k= 1 

This asymptotic expansion is called Stirling ’s series. 

c) Using Stirling’s series, obtain the first two terms of the asymptotics of 
r(x + 1) as x —*■ +oo and compare your result with what was obtained in Ex- 
ample 13. 

d) Following the method of Example 13 and independently of it using Stirling’s 
series, show that 


r(x + l)-V 2 ^(j) ( 1 + I^ + 28 sU + 0 (j 3 )) as ^+°°- 

3. a) Let / e C([0, a],R), S e C^([0, a], R), S(x) > 0 on [0, a], and suppose 
S[x) attains its maximum at x = 0, with S^fO) ^ 0. Show that if /( 0) ^ 0, then 

I(\) := f(x)S k (x) dx ~ -^y sA+1 (°) as ^ +°°- 
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b) Obtain the asymptotic expansion 

oo 

I(k)~ S^ +l (0)J2 a k^~ {k+1) as A.-* +oo, 
7-0 

if it is known in addition that /, S e C (oo> ([0, a], K). 

4. a) Show that 


r ” / 2 Hf 

/ sin” t dt = , / — (l + <9(/? -1 )) asn->+oo. 

Jo V 2/? 

b) Express this integral in terms of Eulerian integrals and show that for n e N it 

equais ■ \- 

c) Obtain Wallis’ formula it — lim„_ >00 

d) Find the second term in the asymptotic expansion of the original integral as 
n -> +oo. 

5. a) Show that (1 — x 2 )" dx ~ as n — > +oo. 

b) Find the next term in the asymptotics of this integral. 

6. Show that if a > 0, then as x +oo 

r »+00 


/ 


t al t x dt~J — x 2a expl — x<* ). 


7. a) Find the principal term of the asymptotics of the integral 

C+OO 


/ 


(1 + t) e df as n -> +oo. 

.—7 r+°° nl t k, 


b) Using this result and the identity k\n K = J Q e n 't k dr, show that 
y ^c k n k\n~ k — + ^(n" 1 )) as 77 -» +oo. 


7=0 


Laplace’s Method in the Multidimensional Case 

8. The exponential estimate lemma. Let M = sup vgfl Six), and suppose that for 
some X = Xq the integral 

F(X)= I f(x)e kS(x) dx (*) 

J D C«" 

converges absolutely. Show that it then converges absolutely for X > Xq and 

\fW\ < \ |/(x)e AS( - v) | dx < Ae XM (X > k 0 ), 

Jd 


where A is a positive constant. 
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9 . Morse’s lemma. Let xq be a nondegenerate critical point of the function S(x), 
x € R", defined and belonging to class C <00> in a neighborhood of xq- Then there 
exist neighborhoods U and V of x — xq and y = 0 and a diffeomorphism cp :V —*■ U 
of class C <O 0 ) (V, U) such that 


| « 

S{<p(y)) = S(xo) + - Vj(y J ) 2 , 


j= i 

where det^'(O) = 1 , are the eigenvalues of the matrix S'' x (x o), and 

y l , ,y n are the coordinates of y e R" . 

Prove this slightly more specific form of Morse’s lemma starting from Morse’s 
lemma itself, which is discussed in Sect. 8.6 of Part 1 . 

10 . Asymptotics of a canonical integral. 

a) Let t — (t\, . . . ,t n ), V — {t e R" | \tj\ < S,j — 1 , 2 , ...,«}, and a e 
C (oo) (V, K). Consider the function 


Xv\ 2 

= / a(t{, . . . , t n )t~~'i dti, 




where t' = (t2,...,t n ) and v\ > 0 . Show that Fi(X.t') ~ a k(t f )X~ ( - k+1 ^ 

as X — > +oo. This expansion is uniform in t' e V' = {t f e K" _1 | |C | < S, j — 
2 , ... ,n} and a \ e C {oo HV' , K) for every k — 0 , 1 


_XX1, 2 

~ n lr > 


b) Multiplying F\(X,t') by e _ 2 ‘2 and justifying the termwise integration of 
the corresponding asymptotic expansion, obtain the asymptotic expansion of the 
function 


Ai>2 2 

F\ (A, dr 2 as L- 

-s 


+00, 


where t" = (Y3, . . . , t n ), V2 > 0. 

c) Prove that for the function 


A(X) — ( •• • f a{t \, . . . , t n )e 2 £/=i v i t j dfj . . . df„, 
J-S J-s 

where vj > 0, j = \ ..... n, the following asymptotic expansion holds: 

OO 

A(X) ~ A. - "/ 2 '^^ a i c X~ k as A. — >■ +00, 


(t=0 


where ao 
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11. The asymptotics of the Laplace integral in the multidimensional case. 

a) Let D be a closed bounded domain in R", /. S e C(D,R), and suppose 
max VS £> Six) is attained only at one interior point xo of D. Let / and S be C foo) in 
some neighborhood of xo and det S"(x o) ^ 0. 

Prove that if the integral (*) converges absolutely for some value X — Xq, then 

oo 

F{X) ~ e ^(x 0 ) x -n /2 a k x~ k as A — > +oo, 
k= 0 

and this expansion can be differentiated with respect to X any number of times, and 
its principal term has the form 


F(X) = e 


_ pt-Sixo) x ~n/2 


(2 nr 


I det S" (xo) | 


(/(xo)+0(^ 1 )). 


b) Verify that if instead of the relation f,Se C (oc> all we know is that / e C 
and S e C (3) in a neighborhood of xo, then the principal term of the asymptotics as 
X -» +oo remains the same with replaced by o(l) as X — »■ +oo. 


The Stationary Phase Method in the One-Dimensional Case 

12. Generalization of the Riemann-Lebesgue lemma. 

a) Prove the following generalization of the Riemann-Lebesgue lemma. 

Let S e C^([a, b], R) and S' (x) ^ 0 on [a, b] =: /. Then for every function / 
that is absolutely integrable on the interval I the following relation holds: 

~ f b 

F{X) = / / (x)e' is(x> d.r -> 0 as X oo, X e R. 

J Cl 

b) Verify that if it is known in addition that / e C ( " +1) (7, R) and S e 
C(” + 2)(/, R), then as X —>■ oo 


F(X) = ^2(ixr (k+]> 
k = o 


1 d 

S'(x) dx 


k f(x) 
S’ix) 


+ o(L“ ( " +1) ). 


a 


c) Write out the principal term of the asymptotics of the function F(X) as X -> 
oo, X e R. 

d) Show that if Se C (00) (7,R), and f\[a,c] e C (2) [fl,c], f\ [cM e C (2) [c, b], but 
/ f. C (2) [a, b], then the function F(X) is not necessarily o(k _1 ) as X — > oo. 

e) Prove that when /, S e C koo ’(I, R), the function F(X) admits an asymptotic 
series expansion as X — > oo. 

f) Find asymptotic expansions as A. oo, X e R, for the following integrals: 
Jq{ 1 + x)- a fj(x, L)dx, j = 1 , 2 , 3 , if a > 0 and = e lkx , f 2 = cos Lx, and 
i/f 3 = sin Ajc. 
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13. The localization principle. 

a) Let / = [a, b] c 1, / e C^°°\l, R), S e C (oo) (7, R), and S’(x) / 0 on 7. 
Prove that in this case 


pb 

F(X):= f(x)e asM dx=0(\k\-°°) as A -* oo. 

J a 

b) Suppose / e Cq°°\i, R), S e C (oo Hf R), and x \, . . . , x m is a finite set of 
stationary points of S(x) outside which Sfx) ^ 0 on 7. We denote by F(X,x; ) 
the integral of the function f(x)e' kS(x> over a neighborhood U (xj) of the point xj, 
j — I , m , not containing any other critical points in its closure. Prove that 

m 

F(X) = ^ F(X, xj) + <9(|A| -0 °) as X -> oo. 

7 = 1 

14. Asymptotics of the Fourier integral in the one-dimensional case. 

a) In a reasonably general situation finding the asymptotics of a one-dimensional 
Fourier integral can be reduced through the localization principle to describing the 
asymptotics of the canonical integral 

C a 

E(X)= / x p ~ l f(x)e ikxa the, 

Jo 

for which the following lemma holds. 


Erdelyi’s lemma Let a > 1, $ > 0, / e C (oo) ([0, a], R) and f (k) {a) = 0 ,k = 
0, 1, 2, Then 

oo 

k+fi 

a k X <* as L — > +oo, 

k = 0 


where 

1 Jk + (i\ f3 r*±fi/ ( «( 0) 

a k =-r\ ) e 2 “ — - — 

a \ a J kl 


and this expansion can be differentiated any number of times with respect to X. 


Using Erdelyi’s lemma, prove the following assertion. 

Let I = [.ro — <5, xo + 5] be a finite closed interval, let f.Se C <O0, (l , R) with 
/ e Co(/, R), and let S have a unique stationary point xq on 7, where S'Gvo) = 0 
but S"(x o) f 0. Then as X — > +oo 

F(X,x 0 ):= / /(x)e ;ASW dv~e''5 s " (xo) e as( ^ ) A.“2 ^ a k x ~ k 

Jxq- 8 k=Q 
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and the principal term of the asymptotics has the form 
F(X,x 0 ) = 


2tc 


i(fsgnS"Uo)+LS(x 0 ))^ (xo) + O(A-l)). 


X|S"(x 0 )| 

b) Consider the Bessel function of integer order n > 0: 


Jn(x) = — f 

x Jo 


J n (x) = — I cos (x sin cp — rup) d<p. 


Show that 


J,i(x) =J — 

7 XX 


Till 7T \ / i \ 

COS|X- — -- ) + 0(x ) 


as x -» +oo. 


The Stationary Phase Method in the Multidimensional Case 

15 . The localization principle. 

a) Prove the following assertion. Let D be a domain in R", f e Cq°°\d. R), 
S e C (oo) (D, R), grad5(x) ^ 0 for x e supp /, and 

F(X) := / /(x)e asw dx. (**) 

Jd 

Then for every leN there exists a positive constant A(k) such that the estimate 
|F(A,)| < A{k)X~ k holds for X > 1, and hence F(X) = 0(X~°°) as X — > +oo. 

b) Suppose as before that f e Cq°° } (D, R), S e C (oo> (D, R), but S has a finite 
number of critical points xi , ... , x m outside which grad S(x) ^ 0. We denote by 
F(X, Xj ) the integral of the function f(x)e ,xs(x ^ over a neighborhood U (xj) of xj 
whose closure contains no critical points except xj. Prove that 

m 

F{X) = F(X, Xj) + 0(X~°°j as L — > +oo. 

7=1 

16 . Reduction to a canonical integral. If xo is a nondegenerate critical point of 
the function S e C^°°\D, R) defined in a domain I) C R", then by Morse’s lemma 
(see Problem 9) there exists a local change of variable x = <p(y) such that xo = 
P(0), S(<p(y)) = S(x o) + \ E"=i Bj iy j ) 2 , where Sj = ±1, y = (y 1 , . . . , y " ), and 
det^'(y) > 0. 

Using the localization principle (Problem 15), now show that if / e C^°\d. R), 
S e C (00) (D. R), and S has at most a finite number of critical points in D. all non- 
degenerate, then the study of the asymptotics of the integral (**) reduces to studying 
the asymptotics of the special integral 

0(X):= f ■■■ f . . . , y")e T ^-9= lEj ^' ) dy 1 • • • dy n . 

J-s J-s 
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19 Asymptotic Expansions 


17 . Asymptotics of a Fourier integral in the multidimensional case. Using Erdelyi’s 
lemma (Problem 14a)) and the scenario described in Problem 10, prove that if D 
is a domain in M", f, S e C (oo ^(D, R), supp / is a compact subset of D , and xq is 
the only critical point of S in D and is nondegenerate, then for the integral (**) the 
following asymptotic expansion holds as X -» +oo: 


oo 


F(X) ~ X- n l 2 e iXS(xo) Y^ a k^~ k , 


k = 0 


which can be differentiated any number of times with respect to X. 
The main term of the asymptotics has the form 



x |detS"(xo)| ''''-[/(xo) + 0 (a as X — ► +oo. 


Here S"(x) is the symmetric and by hypothesis nonsingular matrix of second 
derivatives of the function S at .ro (the Hessian), and sgn,S"(;co) is the signature of 
this matrix (or the quadratic form corresponding to it), that is, the difference v + — v_ 
between the number of positive and negative eigenvalues of the matrix S" (xq). 


Topics and Questions for Midterm Examinations 


1 Series and Integrals Depending on a Parameter 

1. The Cauchy criterion for convergence of a series. The comparison theorem and 
the basic sufficient conditions for convergence (majorant, integral, Abel-Dirichlet). 
The series £(s) = n ~ s ■ 

2. Uniform convergence of families and series of functions. The Cauchy criterion 
and the basic sufficient conditions for uniform convergence of a series of functions 
(M-test, Abel-Dirichlet). 

3. Sufficient conditions for two limiting passages to commute. Continuity, integra- 
tion, and differentiation and passage to the limit. 

4 . The region of convergence and the nature of convergence of a power series. The 
Cauchy-Hadamard formula. Abel’s (second) theorem. Taylor expansions of the ba- 
sic elementary functions. Euler’s formula. Differentiation and integration of a power 
series. 

5. Improper integrals. The Cauchy criterion and the basic sufficient conditions for 
convergence (M-test, Abel-Dirichlet). 

6. Uniform convergence of an improper integral depending on a parameter. The 
Cauchy criterion and the basic sufficient conditions for uniform convergence (ma- 
jorant, Abel-Dirichlet). 

7. Continuity, differentiation, and integration of a proper integral depending on a 
parameter. 

8. Continuity, differentiation, and integration of an improper integral depending on 
a parameter. The Dirichlet integral. 

9. The Eulerian integrals. Domains of definition, differential properties, reduction 
formulas, various representations, interconnections. The Poisson integral. 

10 . Approximate identities. The theorem on convergence of the convolution. The 
classical Weierstrass theorem on uniform approximation of a continuous function 
by an algebraic polynomial. 
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Topics and Questions for Midterm Examinations 


2 Problems Recommended as Midterm Questions 

,A_ 

Problem 1 P is a polynomial. Compute (e d* ) P(x). 

Problem 2 Verify that the vector-valued function e' A xo is a solution of the Cauchy 
problem x = Ax, x(0) = xq. (Here x = Ax is a system of equations defined by the 
matrix A.) 

Problem 3 Find up to order o(l/n 3 ) the asymptotics of the positive roots < 
A .2 < • • • < A .„<••• of the equation sinx + \/x = 0 as n -> oo. 

Problem 4 a) Show that In 2 = 1 — 1/2 + 1/3 . How many terms of this series 

must be taken to determine In 2 within 10 — 3 ? 

b) Verify that \ In = t + + ^t 5 + ■ ■ ■ ■ Using this expansion it becomes 

convenient to compute lnx by setting x = -j/U. 

c) Setting t = 1/3 in b), obtain the equality 



How many terms of this series must one take to find ln2 within 10 — 3 ? Compare 
this with the result of a). 

This is one of the methods of improving convergence. 

Problem 5 Verify that in the sense of Abel summation 

a) 1 — 1 + 1 • • • = j . 

b) YT=\ si n ktp — ^ (p ^ 2nn, n e Z. 

c) j + cos kq> = 0, <p ^ 2nn, n e Z. 

Problem 6 Prove Hadamard’s lemma: 

a) If / e C^(U (xo)), then f(x) = /(xo) + ^(x)(x — xo), where q> e C(U (xo)) 
and (p(x 0 ) = /'(x 0 ). 

b) If/ e C (n '>(U(xo)), then 

fix) = /(xo) + Yy f r (x 0 ) (x - xo) H b 

+ 7 — -/ ( " _ 1 ) (x 0 )(x -x 0 )" _1 +^(x)(x -x 0 )", 

(n - 1)! 

where (peC(U (x 0 )) and ^(x 0 ) = ^/ (n) (x 0 ). 

c) What do these relations look like in coordinate form, when x = (x 1 , . . . , x"), 
that is, when / is a function of n variables? 
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Problem 7 a) Verify that the function 


Jq(x) = 


'-f 

n Jo 


cos xt 

7T ^ 


dr 


satisfies Bessel’s equation y" + Jy' + y = 0. 

b) Try to solve this equation using power series. 

c) Find the power-series expansion of the function Jq(x). 


Problem 8 Verify that the following asymptotic expansions hold 
r+°° , M __ c 

v 2 , 


a) r(a, x) := f^~°° t a ~ l e~ t At ~ e~ a YlkL\ 1 )^“"*. 

b) Erf(x) := f+°° e _r dr ~ 


l 


r(3/2-k)x 2k 


-1 


as x • 


+oo. 


Problem 9 a) Following Euler, obtain the result that the series 1 — l!x + 2!x 2 — 
3!.r 3 + • • • is connected with the function 


S(x) := 



dr. 

1 + xt 


b) Does this series converge? 

c) Does it give the asymptotic expansion of S(x) as x — >• 0? 

Problem 10 a) A linear device A whose characteristics are constant over time 
responds to a signal S(t ) in the form of a 5-function by giving out the signal 
(function) E(t). What will the response of this device be to an input signal f(t), 
—oo < t < +oo? 

b) Can the input signal / always be recovered uniquely from the transformed 
signal / := Afl 


3 Integral Calculus (Several Variables) 

1. Riemann integral on an n -dimensional interval. Lebesgue criterion for existence 
of the integral. 

2. Darboux criterion for existence of the integral of a real-valued function on an 
n -dimensional interval. 

3. Integral over a set. Jordan measure (content) of a set and its geometric mean- 
ing. Lebesgue criterion for existence of the integral over a Jordan-measurable set. 
Linearity and additivity of the integral. 

4. Estimates of the integral. 

5. Reduction of a multiple integral to an iterated integral. Fubini’s theorem and its 
most important corollaries. 
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Topics and Questions for Midterm Examinations 


6. Formula for change of variables in a multiple integral. Invariance of measure and 
the integral. 

7. Improper multiple integrals: basic definitions, majorant criterion for conver- 
gence, canonical integrals. Computation of the Euler-Poisson integral. 

8. Surfaces of dimension k in R" and basic methods of defining them. Abstract 
A: -dimensional manifolds. Boundary of a /.-dimensional manifold as a (k — 1)- 
dimensional manifold without boundary. 

9. Orientable and nonorientable manifolds. Methods of defining the orientation of 
an abstract manifold and a (hyper)surface in R" . 

Orientability of the boundary of an orientable manifold. Orientation induced on 
the boundary from the manifold. 

10. Tangent vectors and the tangent space to a manifold at a point. Interpretation of 
a tangent vector as a differential operator. 

11. Differential forms in a region D C R". Examples: differential of a function, 
work form, flux form. Coordinate expression of a differential form. Exterior deriva- 
tive operator. 

12. Mapping of objects and the adjoint mapping of functions on these objects. 
Transformation of points and vectors of tangent spaces at these points under a 
smooth mapping. Transfer of functions and differential forms under a smooth map- 
ping. A recipe for carrying out the transfer of forms in coordinate form. 

13. Commutation of transfer of differential forms with exterior multiplication and 
differentiation. Differential forms on a manifold. Invariance (unambiguous nature) 
of operations on differential forms. 

14. A scheme for computing work and flux. Integral of a A: -form over a k- 
dimensional smooth oriented surface, taking account of orientation. Independence 
of the integral of the choice of parametrization. General definition of the integral of 
a differential A>form over a ^-dimensional compact oriented manifold. 

15. Green’s formula on a square, its derivation, interpretation, and expression in the 
language of integrals of the corresponding differential forms. The general Stokes 
formula. Reduction to a /. -dimensional interval and proof for a ^-dimensional inter- 
val. The classical integral formulas of analysis as particular versions of the general 
Stokes formula. 

16. The volume element on R" and on a surface. Dependence of the volume element 
on orientation. The integral of first kind and its independence of orientation. Area 
and mass of a material surface as an integral of first kind. Expression of the volume 
element of a ^-dimensional surface S k C R" in local parameters and the expression 
of the volume element of a hypersurface S"~ l C R" in Cartesian coordinates of the 
ambient space. 

17. Basic differential operators of field theory (grad, curl, div) and their connection 
with the exterior derivative operator d in oriented Euclidean space R 3 . 

18. Expression of work and flux of a field as integrals of first kind. The basic inte- 
gral formulas of field theory in R 3 as the vector expression of the classical integral 
formulas of analysis. 

19. A potential field and its potential. Exact and closed forms. A necessary differ- 
ential condition for a form to be exact and for a vector field to be a potential field. Its 
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sufficiency in a simply connected domain. Integral criterion for exactness of 1 -forms 
and vector fields. 

20 . Local exactness of a closed form (the Poincare lemma). Global analysis. Ho- 
mology and cohomology. De Rham’s theorem (statement). 

21 . Examples of the application of the Stokes (Gauss-Ostrogradskii) formula: 
derivation of the basic equations of the mechanics of continuous media. Physical 
meaning of the gradient, curl, and divergence. 

22 . Hamilton’s nabla operator and work with it. The gradient, curl, and divergence 
in triorthogonal curvilinear coordinates. 


4 Problems Recommended for Studying the Midterm Topics 

The numbers followed by closing parentheses below refer to the topics 1-22 just 
listed. The closing parentheses dashes are followed by section numbers (for example 
13.4 means Sect. 4 of Chap. 13), which in turn are separated by a dash from the 
numbers of the problems from the section related to the topic from the list above. 

1) 11.1—2,3; 2) 11.1 — 4; 3) 11.2—1,3,4; 4) 11.3—1,2,3,4; 5) 11.4—6,7 and 

13.2— 6; 6) 11.5—9 and 12.5—5,6; 7) 11.6—1,5,7; 8) 12.1—2,3 and 12.4—1,4; 9) 

12.2— 1,2, 3, 4 and 12.5—11; 10) 15.3—1,2; 11) 12.5— 9 and 15.3—3; 12) 15.3—4; 
13) 12.5—8,10; 14) 13.1—3,4,5,9; 15) 13.1—1,10,13,14; 16) 12.4— 10 and 13.2 — 
5; 17) 14.1—1,2; 18) 14.2—1,2,3,4,8; 19) 14.3—7,13,14; 20) 14.3—11,12; 21) 

13.3— 1 and 14.1—8; 22) 14.1—4,5,6. 


Examination Topics 


1 Series and Integrals Depending on a Parameter 

1. Cauchy criterion for convergence of a series. Comparison theorem and the ba- 
sic sufficient conditions for convergence (majorant, integral, Abel-Dirichlet). The 
series f (s) = J2T=i n ~ s - 

2. Uniform convergence of families and series of functions. Cauchy criterion and 
the basic sufficient conditions for uniform convergence of a series of functions (At- 
test, Abel-Dirichlet). 

3. Sufficient conditions for commutativity of two limiting passages. Continuity, in- 
tegration, and differentiation and passage to the limit. 

4 . Region of convergence and the nature of convergence of a power series. Cauchy- 
Hadamard formula. Abel’s (second) theorem. Taylor expansions of the basic ele- 
mentary functions. Euler's formula. Differentiation and integration of a power se- 
ries. 

5. Improper integrals. Cauchy criterion and the basic sufficient conditions for con- 
vergence (majorant, Abel-Dirichlet). 

6. Uniform convergence of an improper integral depending on a parameter. Cauchy 
criterion and the basic sufficient conditions for uniform convergence (Af-test, Abel- 
Dirichlet). 

7. Continuity, differentiation, and integration of a proper integral depending on a 
parameter. 

8. Continuity, differentiation, and integration of an improper integral depending on 
a parameter. Dirichlet integral. 

9. Eulerian integrals. Domains of definition, differential properties, reduction for- 
mulas, various representations, interconnections. Poisson integral. 

10 . Approximate identities. Theorem on convergence of the convolution. Classi- 
cal Weierstrass theorem on uniform approximation of a continuous function by an 
algebraic polynomial. 

11 . Vector spaces with an inner product. Continuity of the inner product and alge- 
braic properties connected with it. Orthogonal and orthonormal systems of vectors. 
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Examination Topics 


Pythagorean theorem. Fourier coefficients and Fourier series. Examples of inner 
products and orthogonal systems in spaces of functions. 

12. Orthogonal complement. Extremal property of Fourier coefficients. Bessel’s in- 
equality and convergence of the Fourier series. Conditions for completeness of an 
orthonormal system. Method of least squares. 

13. Classical (trigonometric) Fourier series in real and complex form. Riemann- 
Lebesgue lemma. Localization principle and convergence of a Fourier series at 
a point. Example: expansion of cos(av) in a Fourier series and the expansion of 
&Vli{txx)/tcx in an infinite product. 

14. Smoothness of a function, rate of decrease of its Fourier coefficients, and rate 
of convergence of its Fourier series. 

15. Completeness of the trigonometric system and mean convergence of a trigono- 
metric Fourier series. 

16. Fourier transform and the Fourier integral (the inversion formula). Example: 
computation of / for f(x ) := exp(— a 2 x 2 ). 

17. Fourier transform and the derivative operator. Smoothness of a function and the 
rate of decrease of its Fourier transform. Parseval’s equality. The Fourier transform 
as an isometry of the space of rapidly decreasing functions. 

18. Fourier transform and convolution. Solution of the one-dimensional heat equa- 
tion. 

19. Recovery of a transmitted signal from the spectral function of a device and the 
signal received. Sampling theorem (Kotel’nikov-Shannon formula). 

20. Asymptotic sequences and asymptotic series. Example: asymptotic expansion 
of Ei(x). Difference between convergent and asymptotic series. Asymptotic Laplace 
integral (principal term). Stirling’s formula. 


2 Integral Calculus (Several Variables) 

1. Riemann integral on an n -dimensional interval. Lebesgue criterion for existence 
of the integral. 

2. Darboux criterion for the existence of the integral of a real-valued function on an 
n -dimensional interval. 

3. Integral over a set. Jordan measure (content) of a set and its geometric mean- 
ing. Lebesgue criterion for existence of the integral over a Jordan-measurable set. 
Linearity and additivity of the integral. 

4. Estimates of the integral. 

5. Reduction of a multiple integral to an iterated integral. Fubini’s theorem and its 
most important corollaries. 

6. Formula for change of variables in a multiple integral. Invariance of measure and 
the integral. 

7. Improper multiple integrals: basic definitions, the majorant criterion for conver- 
gence, canonical integrals. Computation of the Euler-Poisson integral. 
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8. Surfaces of dimension k in R” and the basic methods of defining them. Ab- 
stract ^-dimensional manifolds. Boundary of a L-dimensional manifold as a (k — 1)- 
dimensional manifold without boundary. 

9. Orientable and nonorientable manifolds. Methods of defining the orientation of 
an abstract manifold and a (hyper)surface in M" . 

Orientability of the boundary of an orientable manifold. Orientation on the bound- 
ary induced from the manifold. 

10. Tangent vectors and the tangent space to a manifold at a point. Interpretation of 
a tangent vector as a differential operator. 

11. Differential forms in a region D C R" . Examples: differential of a function, 
work form, flux form. Coordinate expression of a differential form. Exterior deriva- 
tive operator. 

12. Mapping of objects and the adjoint mapping of functions on these objects. 
Transformation of points and vectors of tangent spaces at these points under a 
smooth mapping. Transfer of functions and differential forms under a smooth map- 
ping. A recipe for carrying out the transfer of forms in coordinate form. 

13. Commutation of the transfer of differential forms with exterior multiplication 
and differentiation. Differential forms on a manifold. Invariance (unambiguous na- 
ture) of operations on differential forms. 

14. A scheme for computing work and flux. Integral of a A: -form over a k- 
dimensional smooth oriented surface. Taking account of orientation. Independence 
of the integral of the choice of parametrization. General definition of the integral of 
a differential L-form over a fc-dimensional compact oriented manifold. 

15. Green’s formula on a square, its derivation, interpretation, and expression in the 
language of integrals of the corresponding differential forms. General Stokes for- 
mula. Reduction to a Ac-dimensional interval and proof for a £>dimensional interval. 
Classical integral formulas of analysis as particular versions of the general Stokes 
formula. 

16. Volume element on R” and on a surface. Dependence of volume element on 
orientation. The integral of first kind and its independence of orientation. Area and 
mass of a material surface as an integral of first kind. Expression of volume element 
of a ^-dimensional surface S k C R" in local parameters and expression of volume 
element of a hypersurface S"~ l C R" in Cartesian coordinates of the ambient space. 

17. Basic differential operators of field theory (grad, curl, div) and their connection 
with the exterior derivative operator d in oriented Euclidean space R 3 . 

18. Expression of work and flux of a field as integrals of first kind. Basic integral 
formulas of field theory in R 3 as the vector expression of the classical integral for- 
mulas of analysis. 

19. A potential field and its potential. Exact and closed forms. A necessary differ- 
ential condition for a form to be exact and for a vector field to be a potential field. Its 
sufficiency in a simply connected domain. Integral criterion for exactness of 1 -forms 
and vector fields. 

20. Local exactness of a closed form (Poincare’s lemma). Global analysis. Homol- 
ogy and cohomology. De Rham theorem (formulation). 
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Examination Topics 


21. Examples of the application of the Stokes (Gauss-Ostrogradskii) formula: 
derivation of the basic equations of the mechanics of continuous media. Physical 
meaning of the gradient, curl, and divergence. 

22. Hamilton’s Nabla operator, and computation of work with it. Gradient, curl and 
divergence in a 3-dimensional orthogonal system of curvilinear coordinates. 



Examination Problems 

(Series and Integrals Depending on a Parameter) 


1. We shall consider a sequence of real-valued functions { f n } defined on the interval 
[0, 1], for example. 

a) What types of convergence for a sequence of functions do you know? 

b) Provide the definition of each of them. 

c) What are the relations between them? (Prove the relation or give an explana- 
tory example when there is no such relation.) 

2. Let / be a periodic function with period 2 tc . Suppose it is identically zero on the 
interval ] — jt, 0[ and f(x) = 2x on the interval [0, zr] . Calculate the sum S of the 
standard trigonometric Fourier series of this function. 

3. a) We know the expansion in power series of the function (1 + x)~ 1 (geometric 
progression). Obtain from it the expansion in a power series of the function ln( 1 + x) 
and justify your steps. 

b) What is the radius of convergence of the obtained series? 

c) Does this series converge at x = 1, and if so, is its sum equal to In 2? Why? 

4. a) It is known that the spectral function (characteristic function) p of a linear 
device (operator) A is everywhere nonzero. How can we find the transmitted signal 
/ if we know the function p and the received signal g = Af. 

b) Let the function p be defined by p(co) = 1 for oj < 10 and pica) = 0 for 
\u>\ > 10. Suppose that we know the spectrum g (Fourier transform) of the received 
signal g and that it is exactly 'g(co) = 1 for \oj\ < 1 and g~(co) = 0 for \a>\ > 1 . Finally, 
suppose that it is also known that the input signal / does not contain some other 
frequencies apart from the frequencies transmitted by the device A (i.e., beyond the 
frequencies \co\ < 10). Find the input signal /. 

5. Using Euler’s r function and Laplace’s method, obtain the very useful asymp- 
totic Stirling’s formula n\ ~ \/27r«(|)' J . 
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Intermediate Problems 

(Integral Calculus of Several Variables) 


1. Compute the values of the following forms u> in R" on the given set of vectors. 

a) (o — x 1 dr 1 applied to the vector £ = (1, 2, 3) € 2 

b) a> = dx 1 Adi 3 +x’ d.v 2 Ad.r 4 applied to the ordered pair of vectors (§i, &) £ 

rR (t, 0.0.0)- (Set = = (la 1 ,..., ^).) 

2. Let f n be smooth functions with argument x = (x 1 , , x n ) e K" . Ex- 

press the form d/ 1 A • • • A d f" in terms of the forms dx 1 , . . . , dx". 

3. Let F be a vector field of a force acting on a domain D C R 3 . By the action 
of this vector field an object was transferred along a smooth path y C D from the 
point a e D to the point b e D. Calculate the work done by the vector field in this 
process. 

a) Write the formula for the calculation of this work as an integral of the first 
type and as an integral of the second type (i.e., in terms of d.v and dx, dy , dz, 
respectively). 

b) Prove that in the case of the gravitational vector field, this work does not 
depend on the path and that it is equal to ... ? 

4. Consider the following problem about the flux of a vector field. 

a) One has the vector field V (for instance, the vector field velocity of some 
current) on the domain Del 3 . Write a formula for the calculation of the flux of 
the vector field V through the oriented surface .S’ = S+ C D as an integral of the first 
type and as an integral of the second type (i.e., in terms of d a and dy A dz, dz A dx, 
dx A dy respectively). 

b) Consider a convex polyhedral domain D cl 3 . On each of its faces is con- 
structed a vector pointing toward the exterior normal direction with magnitude equal 
to the area of the corresponding side. Physics states that the sum of these vectors is 
equal to zero (otherwise, we could build a perpetual motion device). Mathematics 
agrees. Prove this fact. 

c) Deduce Archimedes’s law by a direct computation (calculate the buoyancy 
force acting on a submerged body in a bathtub completely filled with water, for 
example, as the resulting pressure on the surface of the body). 
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Appendix A 
Series as a Tool 
(Introductory Lecture) 


When a geological deposit is discovered, it is explored and then exploited. In math- 
ematics, it is also like that. Axiomatics and useful formalisms arise as the result of 
solving concrete questions and problems. They do not fall down from the sky, as it 
seems to inexperienced students when everything starts with axioms. 

This course is largely dedicated to series, i.e., basically limits of sequences. We 
shall give at least an initial idea of how and where this tool works, in order to con- 
vince ourselves that the study of this remarkably effective machinery, namely the 
theory of series, does not reduce to the abstract study of the convergence of series 
(the existence of a limit). 


A.l Getting Ready 

A.1.1 The Small Bug on the Rubber Rope 

(Problem proposed by the academician L.B. Okun to the academician A.D. Sakha- 
rov.) 1 

Problem 1 You hold one end of a 1 km long rubber rope. A small bug crawls toward 
you from the other end, which is fixed, with a speed of 1 cm/s. As soon as it crawls 
one centimeter, you stretch the rubber rope another kilometer every time. Does the 
insect ever reach your hand? And if it does, how long will it take? 


'Martin Gardner in his book Time Travel and Other Mathematical Bewilderments (New York: 
W. H. Freeman & Company, 1987, English, p. 295) writes, “This delightful problem, which has 
the flavor of a Zeno paradox, was devised by Denys Wilquin of New Caledonia. It appeared first in 
December 1972 in Pierre Berloquin’s lively puzzle column in the French monthly Science et Vie!’ 
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A Series as a Tool (Introductory Lecture) 


A. 1.2 Integral and Estimation of Sums 

After some thinking, it may occur to you that the following sum might be useful in 
finding the answer 5„ = l + 2 + |d ^ . 

Problem 2 Recall the integral, and show that S„ ~ 1 <fildx<S„-i. 


A. 1.3 From Monkeys to Doctors of Science Altogether in 10 6 Years 

Littlewood in his famous book Littlewood’s Miscellany, speaking about large num- 
bers, wrote that 10 6 years is the time needed to convert monkeys into doctors of 
science. 2 

Problem 3 Would the little bug arrive in time for the thesis defense or at least before 
the end of the universe? 


A.2 The Exponential Function 

A.2.1 Power Series Expansion of the Functions exp, sin, cos 

According to Taylor’s formula with remainder in Lagrange’s form, one has 

v 1 1 , 1 „ 

e = 1 H x H x 1 H 1 x + r n (x), 

1! 2! n ! 

where r n (x) = • x" +1 and |£| < |x|; 

l 1 (— 1 )” 

COSX =1 X 2 -) X 4 — X 2 " + 7-2,, (x), 

2! 4! 2«! 

where r 2n (x) = ^TT^ryr cos (t + f ( 2n + 1))* 2 " +1 and |£| < |x|; 

1 1 (_1)« 

sinx = x - —x 3 + —x 5 b — — — -x 2n+1 + 7- 2 „+i(x), 

3! 5! (277 + 1)! 

where r 2n+ i(x) = sin(f + j(2n + 2))x 2,,+2 and |£| < |x|. Since for every 

fixed value x e K, the remainder in each of the above formulas clearly tends to zero 


2 John E. Littlewood, Littlewood’s Miscellany. Cambridge: Cambridge University Press, 1986, En- 

glish, p. 212. 
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as n — > oo, we can write 


, , 1 1 2 1 3 1 4 1 5 1 

e = 1 + — x + —x- + — x J + —x 4 H x 5 H 1 x 

1! 2! 3! 4! 5! n\ 


n 


1 2 1 4 

cosx = 1 x" H x 

2! 4! 

1 3 1 5 

stnx = x — —x + —x 


<=!>>+..., 

2 «! 

(-D” x 2n + l , 
(2n+ 1)!' 


A.2. 2 Exit to the Complex Domain and Euler’s Formula 

We substitute x for the complex number ix in the right-hand side of the first of these 
equalities. Then, after some simple arithmetic manipulations, we obtain Euler’s out- 
standing relationship 

e ,x = cosx + i sinx. 

Setting x — n , we find that e ,7r + 1 = 0. This is the famous equation connecting 
the fundamental constants of mathematics: e from analysis, i from algebra, n from 
geometry, 1 from arithmetic, and 0 from logic. 

We defined the function exp for purely imaginary values of the argument and 
obtained Euler’s formula e ,x = cosx + i sinx, from which, clearly, it also follows 
that 

cosx = ^(e !A + e~ ,x ) and sinx = ^(e' 4 — e~ ,x ). 


A.2. 3 The Exponential Function as a Limit 

We know that (1 + -) n -> e A as n — >• oo for x e K. It is natural to assume that 
e z = limj^ooCl + 0", where now z = x + iy is an arbitrary complex number. 
A computation of this limit gives e z = e A (cos y + i sin v). 

Problem 4 Verify this and obtain a formula for cosz and sinz. 


A.2. 4 Multiplication of Series and the Basic Property 
of the Exponential Function 

The expression e z = e v (cos y + i sin y) for e A+, - v can be naturally obtained from 
the relation e A+, - v = e x e' y if it is valid for complex values of the argument of the 
function exp. 
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We shall prove this by direct multiplication. Let u and v be complex numbers. 
Setting e" := J^kLo ^ u k and e 1 ' := J2m=o ± v' n we find that 


( oo , \ / oo . \ 0000 11 

y -u k \ ■ ly -v m ) = TT -~u k v m = 

k=0 ! \m = 0 / k = 0 m = 0 

oo . . oo . 

= Y V M *u m = y"-(n + u)"=e M+u . 

^ ^ jt!m! 


n=k-\-m 


n = 0 


We used here the fact that X!n=<r+m Wn\ uk%,m — ( M + v)"> provided that u v — vu. 


A.2.5 Exponential of a Matrix and the Role of Commutativity 


What happens if in the expression 

, 11 , 1 „ 

e A — 1H AH A~ H 1 A"H , 

1! 2! n\ 

we consider A a square matrix, and 1 is the identity matrix of the same size? For 
example, if A is the identity matrix, then it is easy to check that e A turns out to be a 
diagonal matrix, with elements e on the main diagonal. 


Problem 5 a) Calculate exp A for the following matrices A: 


c 





0 0 
1 0 


0 1 
0 0 



b) Let A i and A 2 be the last two matrices of order two. Find c' 1 1 , c' 2 and check 
that e A * • e A2 / e Al+A2 . What is going on here? 

c) Show that e fA = I + t A + o(t), for t — > 0. 

d) Check that det(7 + tA) = 1 + t ■ (tr A) + o(t), where tr A is the trace of the 
square matrix A . 

e) Prove the important relationship dete A — e trA . 


A.2.6 Exponential of Operators and Taylor’s Formula 

Let P(x) be a polynomial and A = ^ the differentiation operator. Then ( AP ) (x ) — 
g(x) = P'(x). 



A.3 Newton’s Binomial 
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Problem 6 a) Check that the relation exp(r^r).P(x) = P (x + t) is what you know 
as Taylor’s formula. 

b) By the way, how many terms of the series e' do you have to consider in order 
to obtain a polynomial that allows you to calculate e x on the interval [—3, 5] with 
an accuracy up to 10 -2 ? 


A.3 Newton’s Binomial 

A. 3.1 Expansion in Power Series of the Function (1 + Jt)“ 

Newton knew the validity, for every natural number a, of the formula for the bino- 
mial expansion 


(!+*)“ = l + -x- 


a(a - 1 ) 2 . 

x" H 

2 ! 


a(a — 1 ) • 


2 ! 


(a — n + 1) 2 , 
x- + • • 


and then he remarked that this formula remains valid for arbitrary a , but the number 
of terms in the sum might be infinite. 

For instance, (1 + x) -1 = 1 — x + x 2 — x 3 H if |x| < 1. 


A.3. 2 Integration of a Series and Expansion o/ ln(l + x) 

By integrating the last series over the interval [0, x], we find that 

1 ? 1 , 

ln(l+x)=x x H — x — ••• for Lx I < 1 . 

2 3 


A.3. 3 Expansion of the Functions (1 + x 2 ) 1 and arctanjc 


Analogously, we write the expansion (1 + x 2 ) 1 = 1 — x 2 + x 4 — x 6 + • • • , we 
integrate its terms over the interval [0, x], and we obtain 


arctanx = x — 


1 3 

-x 3 + 



If we set x = 1 , this expansion seems to imply that j = l — 5 + 5 — yH • 

Perhaps this is true (and certainly it is), but we have the feeling that we are al- 
ready going beyond the limits of what is permitted. The following example will only 
reinforce our concerns. 
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A3. 4 Expansion of (1 + x) 1 and Computing Curiosities 

For x = l, the expansion (1 + v) _1 = 1 — x + x 2 — x 2, + • • • leads to the equality 
5 = 1 — 1 + 1 — 1 + • • • . 

By grouping terms, we can obtain ^ = (1 — 1) + (1 — 1) + • • • = 0 and we can 
obtain j = l + (— 1 + 1) + (— 1 + 1) + • • • = 1 . 

After this, it is necessary to question almost everything that we have done so 
successfully and nonchalantly by multiplying the infinite sums (series), rearranging 
and grouping their terms, and integrating them. All this must obviously be clarified. 
We shall do it soon, but before that, we mention yet another area where series are 
commonly used. 


A.4 Solution of Differential Equations 
A.4.1 Method of Undetermined Coefficients 

Consider the simplest equation x + x — 0 of harmonic oscillations. We shall look for 

the solution as a series x(t) — ao + a\t + a 2 t 2 H . Substituting the series into the 

equation, grouping the terms with equal powers of t, and equating the coefficients 
with the same powers in t on both sides of the equation, we obtain an infinite system 
of equations: 

2a2 + ao = 0, 2-3ci3+ai=0, 3 • 4«4 + «2 = 0, 

If the initial conditions x(0) = vo and x'(0) = Vo are given, then from the series 

x(t) = ao + ait + ait 2 H , and x'(t) = a\ + 2 a 2 t H , we find that ao = A'o and 

a i = i>o. If we know ao and oi, we can find successively and uniquely the remaining 
coefficients of the expansion. 

For example, if x(0) = 0 and x'(0) = 1, then 

1 s Is 

x(t) = t — — t + —t — • • • = sinf, 
and if x(0) = 1 and .r'fO) = 0, then 

1 , 1 , 

x(t) =1 1 H 1 — ■ ■ ■ = cos t. 

2! 4! 


A.4. 2 Use of the Exponential Function 

What happens if the solution that we are looking for has the form x(t) = e Xt ? Then 
x + x = (X 2 — 1 ) = 0, and therefore X 2 + 1 = 0, i.e., X — i or X = — i . But what 

are these strange complex oscillations x(t ) = e' r , x(t) = e ~" , and x(t) — cje' f + 
C2e _,f ? 



A.5 The General Idea About Approximation and Expansion 
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Problem 7 Analyze the situation and solve the problem, for example, if x (0) = 0 
and x'(0) — 1 or if x(0) = 1 and x'(0) = 0. Recall Euler’s formula and compare 
your results with those obtained above. 


A.5 The General Idea About Approximation and Expansion 

A.5.1 The Meaning of a Positional Number System. Irrational 
Numbers 

Recall the usual representation of the number jt = 3. 1415926 ... or in general a dec- 
imal expansion ao. a 10203 ■ ■ ■■ this is the sum aolO 0 + «i 10 _1 + 02 10 -2 + 03 10 -3 + 

We know that a finite expansion corresponds to a rational number, and the repre- 
sentation of an irrational number requires an infinite number of decimal digits, and 
therefore requires the study of an infinite number of terms and infinite sums, i.e., 
series. 

If we truncate a series at some point, we get a rational number. We usually work 
such numbers. What happened here? We have simplified the object, allowing some 
error. This means that we are approximating a complex object (an irrational number 
in this case) through some other objects (the rational numbers here), while allowing 
some error, which we call the degree of precision of the approximation. An im- 
provement in the precision leads to the complication of the object that we use as an 
approximation. A compromise has to be found depending on the concrete circum- 
stances. 


A.5. 2 Expansion of a Vector in a Basis and Some Analogies 
with Series 

In linear algebra and in geometry, we decompose vectors in terms of a basis. For 
mathematical analysis, the traditional representation 

fix) = /( 0 ) + + Y\f"(0)x 2 + ■■■ 

actually means the same thing if we consider that the basis is the set of functions 
e„ = x" . This is the Taylor series of the function / at the point xo = 0. 

Analogously, if some periodic signal or process fit) is subjected to spectral anal- 
ysis, then one is interested in its decomposition fit) — cos nt + b n sin nt 

into the simplest harmonic oscillations. Such series are called classical (or trigono- 
metric) Fourier series. 

What is new in this situation, in comparison with that in linear algebra, is that we 
consider here an infinite sum, which is understood as the limit of finite sums. 



654 


A Series as a Tool (Introductory Lecture) 


Thus in the space of our objects one must define the concept of proximity be- 
tween the objects, in addition to the structure of a linear space, allowing one to be 
able to consider the limit of the sequence of the objects themselves or their sum. 


A.5.3 Distance 

The proximity between objects is determined by the presence of a particular con- 
cept, the concept of neighborhood of an object (neighborhood of a point in the 
space). This is the same as specifying a topology in the space. In topological spaces 
it is possible to speak about limits and continuity. 

If in a space, a distance between objects, i.e., the points of the space, is somehow 
introduced, then the neighborhoods of a point are automatically defined, and even 
more specifically, the <5 -neighborhoods of a point. 

The distance between points of the same space can be measured in different 
ways. For example, the distance between two continuous functions over an interval 
can be measured by the maximum of the absolute value of the difference between 
the values of the functions on this interval (uniform metric), and it is also possible to 
measure it by the integral of the absolute value of the difference of the functions over 
this interval (integral metric). The choice of the metric is dictated by the problem 
under consideration. 



Appendix B 

Change of Variables in Multiple Integrals 
(Deduction and First Discussion of the Change 
of Variables Formula ) 1 


B.l Formulation of the Problem and a Heuristic Derivation 
of the Change of Variables Formula 


By studying the integral in the one-dimensional case, at some moment we obtained 
an important change of variables formula for such an integral. Our task now is to 
find a change of variables formula in the general case. We formulate the problem 
more precisely. 

Let D x be a set in R" , / an integrable function on D x , and tp : D t —¥ D x a 
mapping t i-»- <p(t) from the set D, C R" to D x . The question is, what is the law, 
assuming that we know / and <p, that allows us to find a function t/r on D, such that 
we have the equality 



which reduces the computation of an integral over D x to an integral over D, ? 

We suppose first that D t is an n -dimensional interval I C R” and <p : I —*■ D x 
is a diffeomorphic mapping from 7 onto D x . To every partition of the interval I 
into subintervals I\, I 2 , . . . , h corresponds a partition of D x into subsets (p(lj '), 
i = If all these sets are measurable and intersect pairwise only on sets 

of measure zero, then by the additivity of the integral, 



(B.l) 


If the function / is continuous on D x , then the mean value theorem implies 



'Fragment of a lecture with an alternative and independent proof of the change of variables for- 
mula. 
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where e <p(Ij). Since /(§,•) = /(^>(t,)), with r, — < then it remains for us 
to link with /r(/ ; ) = |/,j. 

If <p is a linear transform, then </>(/,•) is a parallelepiped, whose volume we know 
from analytical geometry and algebra and is equal to | det | /x ( /,- ) . But a diffeo- 
morphism is locally almost a linear map. Therefore, if the size of the intervals 
/,- is sufficiently small, then it can be assumed, with a small relative error, that 
fi((p(li)) \det<p'(ti)\fi(Ii) (it is possible to prove that with a proper choice of 
the point r,- e , one has the exact equality). In this way, 



k 

f(x)dx »^/((p(t ( ))|det<p'(ri)| • |/,j. 
i=l 


(B.2) 


However, on the right-hand side of this approximate equality there is the integral 
sum of the function f(<p(t)) | det (?) | over the interval /, corresponding to the par- 
tition P of this interval with marked points r. In the limit X(P) 0, from equa- 
tions (B.l) and (B.2) we get 



f(x)dx= f f((p(t))\det(p\t)\dt. 
J D, 


(B.3) 


This is the required formula together with its explanation. Note that it is possible 
to justify rigorously each step of this deduction, which led us to the formula. Strictly 
speaking, we need to prove only the validity of the last passage to the limit, that the 
integral on the right-hand side of (B.3) exists, and also to explain the approximation 
P(<p(Ii)) » |det<£>'(r/)| • |/,j. 

Let us do it. 


B.2 Some Properties of Smooth Mappings and Diffeomorphisms 

a) Recall that a smooth mapping tp from a closed and bounded interval I C R" 
(or from any other convex compact subset) is a Lipschitz function. This follows 
from the mean value theorem and the boundedness of <p' (because of the continuity) 
over a compact set 

\<P(t 2 ) - <p(h)\ < 1 9 ?/ (' r ) | • 1^2 — III < L\h — t\\. (B.4) 

T€[li,fe] 

b) Thus, the distance between the images of the points under the mapping (p 
cannot exceed L times the distance between the points. 

For instance, if some subset E C I has diameter d . then the diameter of its image 
cp(E) is not more than Ld, and the set <p(E) can be covered with (n-dimensional) 
cubes with edges of size Ld and volume (Ld) n . 

Thus if £ is a cube with edges of size S and volume S ' 1 , then its image is covered 
by a standard coordinate cube of volume (L«Jn8) n . 


B.3 Relation Between the Measures of Image and the Pre-image 
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c) It follows from this that the image under smooth mappings of 0-measure sets 
have also measure 0 (in the sense of n -dimensional objects). [After all, in the defini- 
tion of a set of measure zero, it is possible to consider coverings by cubes, instead of 
a covering with general n-dimensional intervals, i.e., “rectangular parallelepipeds”, 
as we can easily see.] 

If a smooth mapping q : D t -> D x has also an inverse smooth mapping : 
D x —> D t , i.e., if <p is a diffeomorphism, then it is clear that the pre-image of a set 
with measure zero also has measure zero. 

d) Since under a diffeomorphism, the Jacobian of the mapping det q>' is every- 
where different from zero, and the mapping itself is bijective, then (due to the in- 
verse function theorem) the interior points of any set under such a mapping are 
transformed into the interior points of the image of this set, and the boundary points 
are transformed into the boundary points of the image. 

Recall the definition of an admissible (Jordan-measurable) set, as a bounded set 
whose boundary set has measure zero; thus we can conclude that under diffeomor- 
phisms, the image of a measurable set is again a measurable set. 

(This is also true for any smooth mapping. However, for diffeomorphisms it is 
even true that the pre-image of a measurable set is also a measurable set.) 

e) This latter in particular means that if <p : D, — > D x is a diffeomorphism, then 
from the existence of the integral on the left-hand side of formula (B.3) there follows 
(based on Lebesgue’s criterion) the existence of the integral on the right-hand side. 


B.3 Relation Between the Measures of the Image and the 
Pre-image Under Diffeomorphisms 

We shall show that if q > : / — »■ q(I) is a diffeomorphism, then 



(B.5) 


under the assumption that the integrand det <p' is positive. 

Hence, by the mean value theorem, in particular, we find that there is a point 
r el such that 


=det</(r)|/|. 


(B.6) 


Formula (B.5) is actually a particular case of (B.3), when / = 1. 

For linear mappings, this formula is already known, although perhaps without 
discussing those details related to the fact that it is valid (for linear maps) not only 
for simple parallelepipeds but for all measurable sets. Let us clarify this. We know 
that a linear map is the composite of elementary linear mappings, which, up to a 
possible permutation of a pair of coordinates, are reduced to a change in only one 
of these coordinates: multiplying or adding a number of any one of the coordinates 
to another one. Fubini’s theorem allows us to determine that in the first case, the 
volume of any measurable set is multiplied by the same factor that multiplies the 
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coordinate (more precisely, its absolute value if we consider nonoriented volume). 
In the second case, although the face changes, its volume remains the same, since 
the corresponding one-dimensional section only moves, keeping its linear measure. 
Finally, a permutation of a pair of coordinates changes the orientation of the spatial 
frame (the determinant of such a linear transformation is — 1), but it does not change 
the nonoriented volume of the face. (In the language of Fubini’s theorem, this is just 
a change in the order of two integrations.) 

It now remains to recall that the determinant of the composition of linear map- 
pings is the product of the determinants of the factors. 

Thus, considering that for linear and affine mappings the formula (B.5) is already 
established, we prove it for an arbitrary diffeomorphism with positive Jacobian. 

a) We use again the finite-increment theorem, but now to estimate the possible 
deviation of the mapping cp : I — > cp(I) from the affine mapping t A(t) — cp(a) + 
(p'(a){t — a), where t is a variable, and a is a fixed point in the interval I . The 
mapping A : I -> A(7) is simply the linear part of the Taylor expansion of the 
function (p at the point a e I . 

If we apply the finite-increment (mean value) theorem to the function t — »■ tp(t) — 
(p'(a)(t — a), we obtain 

\<p(t) — (p(a) — (p\a){t — £?)| < sup \(p\r) — cp’ (a)\ ■ \t — ci\. (B.7) 

re[a,t] 

Given the uniform continuity of the continuous function <p' on the compact set /, 
from equation (B.7) we conclude that there is a nonnegative function S — »■ e(<5), 
tending to zero as S — > +0, such that for any two points t, a e / C R" , 

\t — a\ < V«<5 =>• \<p(t) — A(f)| = | <p(t) — <p(a) — (p' (a)(t — a ) | < e(S)S. (B.8) 

b) Now we go back to the proof of formula (B.5). First we shall carry out a small 
technical simplification: we shall assume that the lengths of the edges of the paral- 
lelepiped I are commensurable and that therefore, they can be divided into equal 
cubes {/} with arbitrarily small (as necessary) edges <5; = S and volume <5? = S", 
i.e„ I = \JiIi and |/| = £/ 1 1 =£/*?■ 

In every cube /, , we fix a point a; , we build the corresponding affine mapping 
Aj(t) = <p(ai) — <p'(ai)(t — a,), we consider the image A,- (3/,) of the cube’s I , 
boundary 3 /,- under the mapping A/, and we consider the s(S) S -neighborhood of 
this image, which we denote by A,. By (B.8), the image (p(dlj) of the boundary 3/, 
of the cube /, lies in A, under the diffeomorphism <p. Thus, one has the following 
inclusions and inequalities: 

A, (/,) \ Aj C <p(Ii) C Ai(Ii) U Aj, 

\Ai(Ii)\ - \Aj\ < |q£>(/,-)| < | A,- (/,-)] + \A{\. 

When we take the sum over all indices, we have 

£|a«-m)| - E ia-i < k(/)| = El^-)| < EI A '^)I + E ia-i. (b.9) 

i i i i i 


B.4 Some Examples, Remarks, and Generalizations 


659 


As 5 —»■ +0, 

^]|A;(7,)| = ^dety/(a I )|/,| l det <p'(t) dr. 
i i I 

Therefore, to prove formula (B.5) in our case, it remains to verify that JA A, — > 0 
if <5 +0. 

c) We estimate from above the volume |A;|, based on the estimates (B.4) and 
(B.8). According to (B.4), the edges of the parallelepiped A,(/,) have length not 
greater than LS, where 8 = Si is the length of the edge of a cube /,. Thus the 
(n — 1) -dimensional “area” of any of the 2 n faces of the parallelepiped A , ( /, ) is 
not greater than (L8) n ~ l . We take an e (5) 5 -neighborhood of such a face. Its vol- 
ume is estimated with the value (2 + 2 )e(8)8(L8) n ~ l , where the second 2 appearing 
in the formula is the absorption contribution of the rounded parts of this neighbor- 
hood, occurring near the boundary of the face. In this way, | A, | < 2n •4L" -1 £(<5)<5"; 
therefore, 

I A; | < 8nL" _1 £e(S)S? = 8nL n ~ l e(8)\I\, 
i i 

and we see that JA | A,- 1 — »• 0 for 8 — > +0. 

d) The estimated values for | A,- 1 show at the same time that no matter how ar- 
bitrarily small the reduction of the edges of the original interval / becomes, which 
one might need in order to obtain their commensurability, in the limit this does not 
affect the result. 


B.4 Some Examples, Remarks, and Generalizations 

Thus formula (B.3) for the case D, = I and a continuous function / is already 
proved. We shall consider and discuss some examples. These will show at the same 
time that in fact, we have already proved formula (B.3) not only for the case D t — I 
and not only for a continuous function /. 

a) Negligible sets. As it is used in practice, replacing variables or the use of a 
coordinate transformation formula sometimes has several special features (for ex- 
ample, somewhere there might be a violation of mutual uniqueness, vanishing of 
the Jacobian, or lack of differentiability). Typically, these special features occur on 
sets of measure zero, and are therefore relatively easy to overcome. 

For example, if you need to go from an integral over a circle to an integral over 
a rectangle, we often make the change of variables 

x = rcos(p, y — r sirup. (B.10) 

These are the well-known formulas for the transition from polar coordinates to 
Cartesian coordinates in the plane. The rectangle I = {(r, <p) e R 2 | 0 < r < R, 0 < 
rp < 2tc} under this mapping is transformed into the circle K — {(x, y) e K 2 | 
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x 2 + y 2 < R 2 }. This mapping is smooth, but it is not a diffeomorphism: the whole 
side of the rectangle I on which r — 0 is transformed under this mapping into the 
point (0, 0); the images of the points ( r , 0) and (r, 2tt) coincide. However, if we con- 
sider, for example, the sets I \ 3/ and K\E, where E is the union of the boundary 
3 K of the circle K and the radius going to the point (0, R), then the restriction of 
the mapping (B.10) to the domain / \ 3/ is a diffeomorphism with the set K \ E. 
Therefore, if instead of the rectangle /, we take a slightly smaller rectangle Is lying 
strictly in the interior of /, then we can apply formula (B.10) to this rectangle Is 
and its image Ks . And then, exhausting the rectangle I with such rectangles Is and 
noticing that their images exhaust the circle K, that |/^| —*■ |/| and Ks\ —> | K\, in 
the limit we obtain formula (B.3) applied to the original pair K, I . 

This applies, of course, to the general polar (spherical) coordinates system in R" . 

We shall now develop these observations. 

b) Exhaustions and limit transitions. We define an exhaustion of a set E C R" 
to be a sequence of measurable sets {£),} such that E n C E n+ \ c E for every n e N 
and U,^=i E n = E. 

Lemma 1 If{E n } is an exhaustion of a measurable set E, then 

a) lim n^oo ii(E n ) = /j,(E)\ 

b) for every function f e 1Z{E), one has f\E„ G 'R-(En) and 

lim / f(x) dx = / fix) dx. 

H — »-00 J E n J E 

Proof a) Since E n C E n +\ C E, then /u (£), ) < b-(E n + 1 ) < m(£) and 
lim„_ ! . 00 // ( E n ) < ji ( E). For proving the equality in a), we shall show that the in- 
equality lim^oo fJtiE n ) > jj-iE) also holds. 

The boundary dE of the set E is compact and has measure zero. Therefore, it can 
be covered with a finite number of open intervals such that the sum of their volumes 
is less than s for a given e > 0. Let A be the union of these open intervals. Then the 
set O = E U A is open in R m ; by construction, O contains the closure E of the set 
E; and fiiO) < fiiE) + [x)A) < fx(E) + e. 

For every set E n of the exhaustion { E n } we repeat the construction above with 
s n — e/2". We obtain then a sequence of open sets ()„ = E n U A n such that E n C 
<9„, fz(O n ) < iE n ) + /x(A„) < /x(£„) + e„ and |J,“ i O n D IJ,“ t E „ D E. _ 

The system of open sets A. 0\ , O 2 , . ■ ■ is an open cover of the compact set E . 

Let A, O \ , Oj, ■ ■ ■ , Of : be a finite open subcover of the compact set E. Since 
£1 C £2 C ■ ■ ■ C E/<, the sets A, A \, . . . , A^, £/ are also a cover of E, and then 

M(£) < E(E) + m(A) + ixiA[) H F tiiA k ) < nfEk) + 2e. 

It follows from this that niE) < limj^oo ixiE n ). 

b) The fact that f\ E „ e lZ(E n ) is known to us, and it follows from Lebesgue’s 
criterion for the existence of the integral over a measurable set. By the hypothe- 
sis / e IZ(E), there exists a constant M such that |/(x)| < M over E. From the 
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additivity of the integral and the general estimates for the integral, we get 




<M^i(E\E n ). 


Hence, taking into account what we proved in a), we conclude that assertion b) 
holds. □ 


The additivity of the integral and the possibility of exhausting the domain of 
integration with the domains where the change of variables formula works (i.e., 
it is directly applicable) allow us to apply the formula to the original domain. In 
general, the idea of exhaustion lies at the heart of many constructions in analysis. In 
particular, it is fundamental in the definition of improper integrals. 



Appendix C 

Multidimensional Geometry and Functions 
of a Very Large Number of Variables 
(Concentration of Measures and Laws of Large 
Numbers) 


C.l An Observation 

Almost the entire volume of a multidimensional body is concentrated in a small 
neighborhood of the boundary of the body. 

Problem 1 a) Check this in the examples of the cube and the ball. Show that if we 
remove the shell with thickness 1 cm from a 1000-dimensional watermelon with 1 
meter radius, then there remains less than a thousandth of the original watermelon. 

b) If we project the sphere S'" - 1 ( r ) C R" orthogonally onto a hyperplane passing 
through the center of the sphere, then we obtain a ball (double covered) with the 
same dimension n — 1 and the same radius r . Considering what we obtain above, 
notice (still on a qualitative level), that almost all the area of the sphere S n ~\r) for 
« ^> 1 is concentrated in a small neighborhood of the equator, the intersection of the 
sphere with the former hyperplane. 


C.2 Sphere and Random Vectors 


Problem 2 a) The sphere S"~ l (r) with radius r and center at the origin of the n- 
dimensional Euclidean space R" is projected orthogonally onto a coordinate axis. 
We get the interval [— r, r]. We fix another interval [a, b] C [— r, r]. Let b] be 

the area of the part of the sphere S n ~ l (r ) that is projected onto the interval 

[a, b]. Find the quotient , i.e. , the probability Pr„ [a,b] that a randomly chosen 

point on the sphere will be on the layer S 1 ^ 1 ] (r) over the interval [a, b], considering 
that the points are uniformly distributed over the sphere. 

Answer: 


Pr „[a,b] 


/>-(r/r) 2 )Tdr 
fl r ( 1 - (x/r) 2 )' 1 ^ 1 dx 
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b) Let ^ e (0, 1) and [a,b] = [Sr, r]. Show that as n — > oo. 



Hint: You can use Laplace’s method for obtaining asymptotics of the integral over a 
large parameter. 

c) The result obtained in b) implies that the vast majority of the area of a multi- 
dimensional sphere is concentrated in a small neighborhood of the equatorial plane, 
in the layer S"Zg r g r ]( r ) over the interval [—Sr, Sr]. 

Deduce from this that if we take independently and randomly a pair of vectors 
in M", then for n "Jt> 1, it is very likely that they will be almost orthogonal, i.e., their 
scalar product will be close to zero. Estimate the probability that the scalar product 
is greater than s > 0 and calculate its variance for n^> 1. 

d) Prove, based on the result proved in a), that for r = cr -Jn and n — > oo, one has 



e) Considering the result obtained in b), prove now Gauss’s law on the distribu- 
tion of measurement errors and Maxwell’s laws on the distribution of gas molecules 
according to speed and energy (considering in the first case that the observations 
are independent and their mean square stabilizes as the number of observations in- 
creases, and in the second case considering that the gas is homogeneous and that the 
total energy of the molecules in a portion of the gas is proportional to the number of 
molecules in this portion). 


C.3 Multidimensional Sphere, Law of Large Numbers, 
and Central Limit Theorem 

By solving this problem, you will discover the following fact, important in many 
aspects and manifested in many areas (for example, in statistical physics). 

Let S m be the unit sphere in the Euclidean space LRi "' +1 with a very large dimen- 
sion m + 1. Suppose also that we are given a sufficiently regular real-valued function 
on the sphere (for example, from a fixed Lipschitz class). We take randomly and in- 
dependently two points and calculate the value of the function at these points. With 
a high probability, the values will almost coincide and they will be close to a certain 
number M f . 

(This, still hypothetical, number M / is called the median value of the function 
or function median. It is also called the average value of the function in the sense 
of Levy} The motivation for these terms will soon be clear, together with a precise 
definition of the number Mf.) 


*P. Levy (1886-1971) - famous French mathematician, student of J. Fladamard. 
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We introduce some notation and conventions. We define the distance between 
two points on the sphere S’” C R'" +1 , understood in terms of its geodesic metric p. 
We denote by As a 8 -neighborhood in S m of the set A C S m . We replace the stan- 
dard mass of the sphere with a uniformly distributed probability measure //, i.e., 
H(S' n )= 1. 

We have the following assertion proved by Paul Levy, commonly called Levy ’s 
isoperimetric inequality. 

For every 0 < a < 1 and 8 > 0, there exists min{/r(A^) \ A C S m , /x(A) = a}, 
and it is attained on the spherical cap A 0 with measure a. 

Here A 0 = B(r), where B(r) — B(xo,r ) = {.r e S'" | p(x o,x) < r] and 
p(B(r)) = a. 

Problem 3 a) For a — 1/2, i.e., when A 0 is a hemisphere, obtain the following 
result: 

If the subset A C S" +1 is such that /x(A) > 1 /2, then pt{Af) > 1 — ^/jtJEe~ s ""^ 2 . 
(If n — > oo, we can change here ^Jit /8 for 1 /2.) 

b) We denote by M f the number such that 

p{x e S" | f(x) < Mf] > 1/2 and pt{x e S" \ f(x) > M f ) > 1/2. 

It is called the median or average value in the sense of Levy of the function f : 
S" — > R. (If the Mf - level of the function / on the sphere has measure zero, then 
the measure of each of these two sets mentioned above will be equal to exactly half 
of the p- area of the sphere S '" .) 

Obtain the following lemma due to Levy: 

If f e C(5’" +1 ) and A — {x e S" +l \ f(x) = M f ], then p(A s ) > 1 - V Tfl x 
g — ,5 2 n/2. 

c) Let cop (8) = sup{|/(x) — f(y)\ \ p(x, y) < 3} be the modulus of continuity of 
the function f . 

The values of the function / on the set As are close to Mf. More precisely, if 
cop (8) < e, then | fix) — Mf \ < e on As- Thus Levy’s lemma shows that “good” 
functions are actually almost constant in almost their entire domain of definition S'" 
when the dimension m is very large. 

Considering that / e LipfS" -1 , R) and L is the Lipschitz constant of the func- 
tion /, estimate the probability Pr{|/(x) — Mf\ > e} and the dispersion value 
I f{x) - Mf \ for;? » 1. 

d) Obtain, as above, estimates in the case that the function / is not defined on 
the unit sphere but in the sphere S"~ l (r) with radius r. 

e) If / is a smooth function, then we can clearly take the maximum modulus 

of its gradient as the Lipschitz constant L . For example, the linear function S„ — 
f (x\ -( + x n ) has L — L n — -j=. Suppose that we have a sequence of Lipschitz 

functions f n e Lip (S"~ l (r„), R), for which L n = 0(A=) and r n — y/n. 

\ n 

Estimate Pr{| f„(x) — Mf n \ > e} and the dispersion value \f n (x ) — Mf n \ for 
n » 1. 

In particular, for f n = S n deduce the standard law of large numbers. 
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f) Let /„ = x\ + • • • + x n . The levels of this function are hyperplanes in R" 

orthogonal to the vector ( 1 , . . . , 1 ) . The same can be said about the linear function 

S n = -t=(a'i + - • - + x n ), with the only difference that under the movement from 
\ n 

the origin in the direction of (1, . . . , 1), its values coincide with the distances to the 
origin. For this reason, its values are distributed on the sphere S n ~ l (r n ) exactly as 
they are on each of the coordinates. 

Using this discussion and the result of Problem 2.d), setting r n = cr^/n, obtain 
your own version of the central limit theorem. 


C.4 Multidimensional Intervals (Multidimensional Cubes) 

Problem 4 a) Let I be the standard unit interval [0, 1] of the real line R, and /" the 
standard n -dimensional interval in R", usually called the ^-dimensional unit cube. 
This is a unit of volume in R", but its diameter */n for n !£> 1 is extremely huge. 
Thus, even Lipschitz functions on I n with Lipschitz constant L can have values 
spread within L^/n. 

Yet here, as in the above case of a sphere, there is a phenomenon of asymptotic 
stabilization (concentration) of values of such functions in the limit n — > oo. 

Now, try to find the proper formulations of the problem and study the phe- 
nomenon, up to the level of your ability (then check Sect. C.5 of this appendix). 

b) Suppose we have n independent random variables x;, taking values in the 
unit interval [0, 1] and having distribution probabilities pi(x ), which are uniformly 
separated from zero (in particular, all pi(x) may coincide). Then as n grows, the 
large majority of the random points (xi, . . . , x„ ) e I" will lie in close proximity to 
the border of the cube. 

Explain this, and considering the result in a), obtain your own general law of 
large numbers. 

c) Show with an example that if the probability density of the random variables 
in b) is concentrated in the vertices of the cube as point masses, then the asymptotic 
stabilization of values for Lipschitz functions in the limit n — > oo may not occur. 

d) We noted above that although the volume of the cube I n in R" is equal to 1, 
its diameter y/n increases for n 1, which creates difficulties. However, we have 
the following useful compensating observation: if each of two subsets A and B of 
the cube I n has measure greater than an arbitrarily small fixed positive number e, 
then the distance between A and B is bounded from above by a constant depending 
only on s (and not depending on n). Prove this, and use this result if you need it. 

e) Calculate the volume of the unit ball in R" and show that the radius of the ball 
with volume one increases as n/(2jte ) as n —*■ oo. Go back to Sects. C.l and C.2 
and convince yourself again that the normal distribution and the laws related to it 
are closely linked in the geometric aspect with a simple multidimensional object, 
namely with the ball of unit volume. 
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C.5 Gaussian Measures and Their Concentration 

Problem 5 a) We mentioned in Sect. C.2 of this appendix the isoperimetric inequal- 
ity on the sphere, in connection with the discussion of the observed stabilization of 
values (constancy) of regular functions on the multidimensional sphere. The same 
problem about minimizing the measure of a <5 -blowup of a set is important, and for 
the same reason it is also interesting in relation to other spaces that serve as natural 
domains for the relevant functions. 

For example, in the case of the Gaussian probability measures defined by the 
normal probability distribution in the standard Euclidean space R", the answer is 
also known (obtained by Borel). In this case, the extreme domain (with the fixed 
initial value of the Gaussian measure and a <5 -blowup, understood in the sense of the 
Euclidean metric) turns out to be a half-space. 

In particular, if we take the half-space with Gaussian measure j and we directly 
calculate the value of the Gaussian measure of the complement in its Euclidean 
<5-blowup, then considering Borel’s isoperimetric inequality, we can deduce that for 
any set A having a Gaussian measure ^ in the space R", the measure of its 5-blowup 
can be estimated from below with n(As) > 1 — Is, where Is is the integral of the 
density (2 jt) 2 exp(— '-j-) of the Gaussian measure of the half-space, given with 
Euclidean distance <5 from the origin. 

An estimate from above of the integral Is, for example, allows us to claim that 
fi(A s) > 1 — 2exp(— y). Prove this. 

b) This is a rough estimate, but it shows the rapid growth of /x(A^), with an 
increase of <5, whatever the initial set A of measure j is. 

It is very interesting to notice (and considering the possible transition to infinite- 
dimensional spaces, even quite useful) that the last estimate does not depend on the 
dimension of the space. It may seem that this absence of the dimension is a great 
loss and weakness in the estimates within the context of concentration measures 
discussed and in the stabilization of values of functions of several variables. In fact, 
this estimate even contains the principle of the concentration of a measure on the 
unit sphere of large dimension, discussed above. 

It is enough to prove (prove it) that the main part of the Gaussian probability 
measure of the Euclidean space R" for « » I is concentrated in the vicinity of 
the unit Euclidean sphere of radius ~n . This means that at the intersection of this 
neighborhood with the half-space, which is distant from the origin, the proportion 
of this measure is exponentially small. Therefore, the main part of the measure is 
in this neighborhood of the sphere of radius -JTi, which falls in the layer between 
two close parallel hyperplanes, symmetric with respect to the origin. If now we 
move through a homothety from the sphere of radius ■ s /n to the unit sphere, then we 
obtain the principle of concentration of measure on the unit sphere, which we have 
already discussed (do the necessary calculations). In the statement of this principle, 
the dimension of the space occurs explicitly. This dimension was also present in the 
Gaussian case, but it was hidden in the size *Jn of the sphere, and the main part of 
the measure of the whole space is concentrated in a neighborhood of this sphere. 
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C.6 A Little Bit More About the Multidimensional Cube 


In the Euclidean space R" we consider the n -dimensional unit interval (“cube”) 


1 " := 







Its volume is equal to one, although the diameter is Jn. (Recall that the Euclidean 
ball of volume one in R" has radius of order R77, as mentioned above.) We shall 
consider the standard probability measure uniformly distributed on the cube I" . 

Let a — (a 1 , . . . , a n ) be a unit vector, and x — (x 1 , . . . , x”) an arbitrary point in 
the cube /” . 

The following inequality holds (probability estimate of Bernstein type): 

> 1 1 < 2exp(— 6f 2 ). 


Pr„ 


E‘ 

i = 1 


If we interpret the sum ^” =1 a l x l as a scalar product (a, x), we notice that this 
can be large (of order *Jn) if the vector a is not directed along any edge of the 
cube, but along the main diagonal, mixing all coordinate directions equally. If we 


take a — ( A = , 

V» 


v/n 


) in the previous estimate, we deduce that the volume of the 


n -dimensional cube I" concentrates, as n increases, in a small neighborhood of the 


hyperplane passing through the origin and orthogonal to the vector (A- . 


In particular, if we consider a billiard in such a cube as a dynamical system 
(gas) composed with noninteracting particles, then for n^> 1, the large majority of 
particle trajectories will go in a direction nearly perpendicular to the fixed vector 
(4=, . . . , -4=), and they are a large part of the time in a neighborhood of the above 
hyperplane. 


C.7 The Coding of a Signal in a Channel with Noise 

We point out in conclusion another area where the functions with a very large num- 
ber of variables also appear naturally and where the principle of concentration of a 
measure is shown and also used substantially. 

We are already used to the digital (discrete) coding and transmission of signals 
(music, images, messages, information) on a communication channel. In this form, 
a message can be thought of as a vector x = (x 1 , . . . , x n ) in the space R" with a very 
large dimension. The transmission of such messages requires an energy E, which is 
proportional to ||x|| 2 = \x l | 2 + • • • + \x n | 2 (like the total kinetic energy of the gas 
molecules, discussed above). If T is the duration of the transmitted message x, then 
P — E/T is the average power required to transfer one character (a coordinate of 
the vector v). If A is the average time required to transfer a single coordinate of the 
vector x, then T — nA and E = nP A. 
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The transmitting and receiving devices are aligned in a such a way that the trans- 
mitter transforms (encodes) the original message to be transmitted in the form of 
the vector x. It sends it over the communication channel, and the receiver, knowing 
the code, decrypts x, transforming it into the form of the original message. 

If we need to transmit M messages A i , . . . , Am of length n, then it is enough to 
fix n points in the ball of radius s[E, agreeing on this selection with the receiving 
end of the communication channel. If in the communication channel there is no in- 
terference, then having received the vector from the agreed set, the receiver decodes 
it correctly into the corresponding message A. 

If in the channel we do have interference (which is often the case), then because 
of the interference, a random vector f = (I 1 , . . . , |") shifts the transmitted vector a, 
and the vector a + £ arrives at the receiver, and this vector must be properly decoded. 

If the points a\, ... ,a n were chosen in such a way that the balls of radius ||§|| 
with these points as center do not intersect, then an unambiguous deciphering is 
still possible. But if we want to meet this requirement, then we cannot take just 
any points a\, . . . , « ; v/, and there is a problem of dense packing of spheres. This is 
a difficult problem, whose solution in the present situation can be avoided, as was 
shown by Shannon, given that here the dimension n of the space E" is huge. 

We shall allow ourselves sometimes to make mistakes while interpreting the re- 
ceived message. However, we require the probability of error to be arbitrarily small 
(less than any fixed positive number). 

Shannon showed that even in the presence of random noise (white noise) in the 
communication channel with limited capacities, by choosing a long enough code 
(i.e., for a large value of n ), it is possible to achieve velocities of transmission close 
to the velocities of transmission of information in channels without noise, with an 
arbitrarily small probability of error. 

The geometric idea of Shannon’s theorem is directly related to the characteristics 
discussed above of the distribution measures (volumes) of domains in a space with 
large dimension. Let us explain this. 

Suppose that two identical balls in the space R" intersect. If the received signal 
lies in this intersection, then it is possible to have errors in the interpretation of the 
message sent by the source. But if the probability of falling into such an area is 
considered proportional to the relative volume of the region, then it is natural to 
compare the volume of the intersection of the balls with the volume of a ball. We 
carry out the proper estimations. If the centers of two balls of radius 1 are separated 
by the distance e (0 < s < 2), then the intersection of these balls is contained in a 
ball of radius y 7 1 — {s/2) 2 with center in the middle of the segment connecting the 
centers of the original balls. Hence, the ratio between the volume of the intersection 
of the two balls and their own original volume does not exceed (1 — (e/2) 2 )"/ 2 . It is 
clear now that for every fixed e, this value can be made arbitrarily small by choosing 
a sufficiently large value of n . 



Appendix D 

Operators of Field Theory in Curvilinear 
Coordinates 


Introduction 


Almost any book with mathematical problems and even any textbook of mathemat- 
ical analysis states something like the following. “Children, remember”: 

We call the gradient of a function U(u,x,z ) the vector 


gradt/ 


/at/ dU dU\ 
V dx dy ’ dz J 


The curl of a vector field A — (P, Q, R)(x, y, z) is the vector 


curl A := 


/ dR 

3 Q 

dP 

dR 

3 Q 

dP\ 

[dy 

dz ’ 

dz 

dx ’ 

dx 

dy) 


The divergence of a vector field B = (P, Q, R)(x. y, z) is the function 


dP dQ 

div B := h — 

dx dy 


dR 

dz ' 


The fact that this is true only in Cartesian coordinates is not usually discussed, 
as well as what should be done if the coordinate system is different. This is under- 
standable, since the very formulation of this problem already requires some suitable 
definition of these objects. 


D.l Reminders of Algebra and Geometry 

D.1.1 Bilinear Forms and Their Coordinate Representation 

a. Scalar Product and General Linear Forms 

We shall consider a vector space with a scalar product (, }. We can still consider that 
(, } denotes an arbitrary bilinear form on an n -dimensional vector space X. If we 
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choose a basis of the space fp, . . . , then the objects of the space (in particular, 
vectors and forms) will have a coordinate representation. We recall the coordinate 
representation of the bilinear form (, ). 

If we take two vectors x — x'f;, y = y ] g j and their decomposition in terms of 
the basis, then we have {x,y) = (x 1 , y J t-j) — (t-j , t;j)x l y J — gijX l y J . As usual, 
summation over repeated indices is understood. Thus if a basis of the space is given, 
the choice of values £/) = gjj completely defines the bilinear form. 

If the form is a scalar product, then a basis is orthogonal if gjj = 0 for i / j . It is 
assumed here that the form is nondegenerate, of course. 


b. Nondegeneracy of Bilinear Forms 

A bilinear form is called nondegenerate if once we fix a value in one of its argu- 
ments, then the bilinear form is identically zero with respect to the other argument 
if and only if the fixed value is zero (the zero vector). 

The nondegeneracy of the form is equivalent to the fact that the determinant of 
the matrix {gij) is different from zero. Indeed, if the fixed vector x = x'$j is such 
that (x, y) = 0 with respect to y, then %j)x l = 0 and gijx 1 = 0 for every value 
j e {i, ... ,n}. This homogeneous system of equations has a unique solution (zero) 
if and only if the determinant of the matrix (gij) of the system is nonzero. 


D.1.2 Correspondence Between Forms and Vectors 

a. 1-Forms in the Presence of 2-Forms and Their Correspondence with Vectors 

If one has a 2-form (, ), then each vector A can be associated with a 1-form, namely 
the linear form (. A,x ). If the 2-form is nondegenerate, then the correspondence is 
one-to-one. Indeed, if we are given such a linear function a(x) = ajx J (where aj — 
a(%j)) and we want to represent it in the form ( A,x }, where A = §;A', then in 
the coordinates of the vector A we have the system of equations a($) = ($/, $j)A‘, 
j = 1 which is uniquely solvable if the determinant of the matrix (gij) is 
different from zero. 

Thus, the coordinates of the vector A = A' ^ and the coefficients of the 1-form a 
in the same basis {£,• } are linked by the mutually inverse relations 

a j — gij A< > A'=g ,J aj. 


b. Correspondence Between a Vector and an ( n — l)-Form 

Similarly, if one has a nondegenerate n-form Q . each vector B can be associated 
with an (n — l)-form, namely the form Q(B . . . .). 
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We shall deal below with vector fields A , B and carry out this described method 
on the tangent space, for example in relation to the form of work co A — (A, •} and 
the form flux = 17" ( B , . . . ), in the presence of the inner product (, } and the 
volume form Q" , respectively. 


D.l. 3 Curvilinear Coordinates and Metric 

a. Curvilinear Coordinates, Metric, and Volume Form 

Suppose that in an n -dimensional surface (manifold) we have a metric, which in 
local coordinates (f 1 , . . . , t n ) (in the local charts) is given by the form gij(t) d t‘ dt J , 
determined by the scalar product (, }(f), with the corresponding parameter t of the 
tangent plane (tangent space) to the surface. 

For example, if the surface (or curve) is given in a parametric form, it is em- 
bedded into the Euclidean space, and then the scalar product in the tangent planes 
(spaces) to the surface is naturally induced from that in the ambient space. 

We even know how to find the area of such a surface (n-measure), i.e., it is 
necessary to integrate the volume form 

£2 — ^det gij ( t ) dr 1 A • • • A dr". 


b. Orthogonal Systems of Curvilinear Coordinates and Unit Vectors 

Recall that a system of curvilinear coordinates (t l , ,t n ) is called orthogonal if 
gij = 0 for i ^ j. 

The length element in an orthogonal system of curvilinear coordinates is written 
in a particularly simple form: 

ds 2 = gii(f)(dr 1 ) 2 H b g„„(r)(df") 2 . 

It is often rewritten in the more compact notation 

d* 2 = £i(f)(d/ 1 ) 2 + • • • + E„ (r)(dr") 2 . 

The vectors = ( 1 , 0 0), . . . , = (0, 0, 1) of the coordinate directions 

form a basis of the tangent space, corresponding to the value of the parameter t. But 
the norm (length) of these vectors is, in general, not equal to one. We have always, 
independent of whether the system of coordinates is orthogonal, (£; , £,}(f) = ga ( t ), 
i.e., H^ll = y/gn(t). i € {1, ...,«}. 

Thus, the unit vectors (ei, . . . , e n ) (vectors of length one) of the coordinate di- 
rections have the following coordinate representation: 

( — !— ,0,...,0 ), ..., e n = ( 0, . . . , 0, — - — ). 

\VgTT J V Viw 


ei = 
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In particular, if the system of curvilinear coordinates is orthogonal, then the follow- 
ing system of vectors of coordinate directions will be an orthonormal basis in the 
corresponding tangent space: 


e\ 



€ n 



c. Cartesian, Cylindrical, and Spherical Coordinates 

As examples of orthogonal coordinate systems we have the standard Cartesian, 
cylindrical, and spherical coordinates in R 3 . 

Problem 1 Write down the metric g,j ( t ) d t' dr ' in each of these coordinate systems 
and find an orthonormal basis (e\, ei, £' 3 ) . 

Answer In Cartesian coordinates (x,y,z), cylindrical coordinates (r, <p, z), and 
spherical coordinates (R,(p,0) of the Euclidean space R 3 , the quadratic form 
gij(t)dt‘ d t J has the following form: 


d.y 2 = d.v 2 + dy 2 + d z 2 — 

— dr 2 + r 2 d(p 2 + dz 2 = 

= d R 2 + R 2 cos 2 0 d(p 2 + R 2 dO 2 . 


In Cartesian, cylindrical, and spherical coordinates, the triples of unit vectors of 
coordinate directions are the following, respectively: 


*, = ( 1 , 0 , 0 ), 

e r = ( 1 , 0 , 0 ), 

e R = ( 1 , 0 , 0 ), 


e y = ( 0 , 1 , 0 ), e, = ( 0 , 0 , 1 ); 
e,= (oAo), e z = ( 0 , 0 , 1 ); 


@(p — 



1 

RcosO 




D.2 Operators grad, curl, div in Curvilinear Coordinates 
D.2.1 Differential Forms and Operators grad, curl, div 

The differential d U of a function U is a 1-form. When one has a scalar product (, ), 
as we know, to the 1-form dU corresponds a vector A such that d U = (A, •}. This 
vector is called the gradient of the function U and is denoted by grad U . 

Thus, dU = (grad [/,•). 
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Suppose that in the Euclidean space R 3 (or in any three-dimensional Riemannian 
manifold) we have the 1-form to l A = (A, •} corresponding to the field A. The differ- 
ential d w A of this form is a 2-form ar B , corresponding, in the presence of a volume 
form f2 3 , to some vector field B (i.e., ar B = £2 3 (B, •, •))■ Then the field B is called 
the curl of the vector field A, and is denoted by curl A. 

Thus, dco\ = ® c 2 urlA . 

If one has a volume form T?" on an n -dimensional surface (for example on R"), 
then there is defined an ( n — l)-form for the flux of a vector field B, namely the form 
a> B ~ l = C2 n (B, •, •)• The differential du/jf 1 of this (n — l)-form is an n-form, which 
therefore has the type p£2 n . The proportionality factor, the function p, is called the 
divergence of the vector field B and is denoted by div B. 

Thus, dw n ~ { = (div B)£2". 


D.2. 2 Gradient of a Function and Its Coordinate Representation 


a. Coordinate Representation for the Correspondence Between a Vector 
and a 1-Form 


In Sect. D. 1.2, we derived a relation between the coefficients of a 1-form co l A = (A, •) 
and the coordinates of the vector A = A‘ . If we take the unit vectors e,- instead of 
the vectors and since f = fgjjei , then the coordinates of the vector A = A' e ej 
in the basis {e,} and its former coordinates are related through the equation A', = 
A'^gfifovi e{l 

Hence all new related formulas have the form 


: Sij 


V Sii V Sii 


= 8 ,J aj. 


These formulas allow us to write, in terms of the vector A = A',e;, the corre- 
sponding form = (A, ■) = cij dt ' and conversely, to write the vector A = A' e ei in 
terms of the 1-form co l =aj dt f 


Problem 2 Write down in Cartesian, cylindrical, and spherical coordinates of the 
Euclidean space R 3 the explicit form of the 1-form a> A = (A, •), corresponding to 
the vector A = A'e,-. 

Answer The 1-form of has the following form, in Cartesian coordinates (x, y, z), 
cylindrical coordinates (r,cp,z), and spherical coordinates (R . (p.O) of the Eu- 
clidean space R 3 , respectively: 

co\ = A x dx + A y dy + A- dz = 

= A,, dr + A^r dtp + A- dz = 

= Ar dR + A^R cos tp dtp + AqR dO. 
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b. Differential of a Function and the Gradient 


We shall apply the general formula relating the vector A and the form u>\ in the case 
of the form d U = (grad U . •}, in order to find the decomposition grad U = A' e ei. 
Since dU = |^y dr 7 , i.e., aj = |jj, then we have A\, — g‘J ^fgnj^j- 

In the case of an orthogonal system of curvilinear coordinates, the matrix (gjj) 
is diagonal, as well as its inverse matrix (g lJ ). Moreover, g" = l/ga- Hence in this 
case, 


1 dU 

grad ' / = gSTT^' + '" + 


1 dU 
yfg^ dt" ^ 


c. Gradient in Cartesian, Cylindrical, and Spherical Coordinates 

Problem 3 Write down the vector grad U — A l e ei in Cartesian, cylindrical, and 
spherical coordinates of the Euclidean space R 3 . 


Answer The vector grad U has the following form in Cartesian (x,y,z), cylindrical 
(r, 6, z), and spherical (R,(p,0) coordinates of the Euclidean space R 3 , respectively: 


3 U 3 U 3 U 

grad U = — e x + —e y + —e z = 
ax ay az 

3 U 1 3 U 3 U 


— T — e r H “ — Cm H — r — e? — 


dr 
3 U 

Jr 


r dtp 

1 


e R 


R cos 6 3 <p 


3 z 

3 U 1 3 U 
e v + -Ri~ e e- 


R 2 30 


D.2.3 Divergence and Its Coordinate Representation 

a. Coordinate Representation for the Correspondence Between a Vector 
and an (n — l)-Form 

We know that if there exists a nondegenerate «-form Q n in an n -dimensional vector 
space, then one can establish a one-to-one correspondence between a vector B and 
the (n — l)-form 1 = Q" ( B , . . .). We wish to write down an explicit formula 
relating the coordinates of the vector B — B ! £,- and the coefficients of the form 

w " B ~ 1 = bjx 1 A • • • x l A • • • A x ” , considering that both objects are expressed in terms 
of the one basis {£,• } of the space. Here, x l is a linear function as usual, whose action 

is given by assigning the /-coordinate of a vector, i.e., x‘(v) := v l ; the symbol x' 
means that the corresponding factor is omitted. The n-form I?" in the n -dimensional 
vector space is x 1 A • • • A x" or proportional to this standard volume form, equal to 
one on the set of the basis vectors (fi , . . . , §„). 
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In general, the value of the form C2 1 = x 1 A • • • Ax" on any vector set (iq , . . . , u„) 
is equal to the determinant of the matrix (vj ) consisting of the coordinates of these 
vectors. Hence if we consider the rule for the expansion of the determinant on a row, 
we can write 


£2 n (B , . . .) = ^(-1 y-'fl'V A • • • Ax* A • • • Ax". 
i=i 

However, co' B ~ l — Q"(B ); thus 

fl ^ n ^ 

bjX 1 A • • • Ax' A • • • A x" = 1 y~ l B' x l A ■ ■ ■ A x' A ■ ■ ■ A x" . 

i = l i=l 

Therefore, b, — (— 1 )'~ l B' for every i e { 1 . If instead we had the form 
cca" — cx 1 A • • • A x" , then we would have the equation bj — (— l)' _1 cZT for every 
i e { 1 

Recall also that if there is an inner product (, } and a fixed basis {§,-} in a vector 
space, then there is also a natural volume form v /dety,yx 1 A ■■■ Ax" defined, as 
well as the scalar product itself, in terms of the values gjj — %j). 

Finally, recall that in this case, the unit vectors (with respect to the norm) are 
not in general the vectors {§,}, but the vectors e, = §,■ / \fgu- Since — ^gjjei, the 
original decomposition of the vector B — B ' in the basis {e, } becomes B = B' e ei, 
where B' e = J gJJB ' . 

Therefore, if one has a scalar product on the space, then there is a natural volume 
form f2" — ydetgijx 1 A ■ ■ ■ Ax", and if a> B ~ l = i2' g l (B , . . .), then the coefficients 

of the form oo n ^~ 1 = b{X ] A - ■■ Ax' A - ■■ Ax" and the coordinates of the vector B in 
the decomposition B — B' e e, in terms of the basis of unit vectors e,- = / yfgi 7 are 

related by the equations 


b, = (-!)' 1 y/ det gi j 


B'e 

V gii 


In an orthogonal basis, det gij — gn ■ ■ ■ g, m ■ In this case. 


bi — ( 1) -J g\ I • • • gii • • • gnn B' e . 

All of the above remains valid when it is applied to the case of the vector field 
B(t) and the differential form w" B ~ l — &g(B , . . .) of the field generated by the vol- 
ume form. 

Thus if £2' g — ydet gij it) dr 1 A • • • A dr" , 


co n B 1 = bi (r) dr 1 a • • • a dr' a • • • dr" , 
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and B(t) = B' e (t)ei(t) is the decomposition in terms of the unit vectors of the curvi- 
linear coordinates (t 1 , . . . , t”), then 


= c-i 

V 8ii 


B[ = (- 1)'- 1 


V 8ii ^ 

^det gij 


If the system of curvilinear coordinates is orthogonal, we come back to the rela- 
tion bj = (~iy~W8n---8ii---g,mB‘ e . 

In particular, for a 3 -dimensional orthogonal system of curvilinear coordinates 
(f 1 , r 2 , f 3 ), using the same notation Ej = ga mentioned at the beginning, it is pos- 
sible to write the following coordinate representation of the form or B corresponding 
to the vector B — B^ei + B 2 e 2 + B 3 ey. 


co 2 B = b\Je 2 E-$ dr 2 A dr 3 + B 2 e ^J E 2 E\dt 3 A dr 1 + B 3 x /E l E 2 dr 1 A dt 2 = 

= ^EiE 2 eJ —^= dt 2 A df 3 + - 7 ^= df 3 A df 1 + - 7 ^= df 1 A dt 2 

W^i VE 2 v E^ 

(Bear in mind that in the 3-dimensional case, the 2-form or is not usually written 
as b 1 dr 2 A dr 3 + a 2 dt 1 A dt 3 + b 2 dr 1 A dr 2 , but as ai dt 2 A dr 3 + a 2 dr 3 A dr 1 + 
«3 dr 1 A dr 2 ; for example, P dy A dr; + Qdz A d.v + R dx A dy.) 


Problem 4 Specify the explicit form of the 2-form ur B — Q 3 (B , . . .) corresponding 
to the vector field B = B' e ej in Cartesian, cylindrical, and spherical coordinates of 
the Euclidean space R 3 . 


Answer The form co 2 B has the following form in Cartesian ( x,y,z ), cylindrical 
(r, 6, z), and spherical ( R , <p, 6) coordinates of the Euclidean space R 3 : 

col — B x dy A dz + B v dz A dr + B- dr A dy = 

= B, r dcp A dz + B v dz A dr + B z r dr A dcp — 

— BrR 2 cosd dcp A dO + B^R d6 A dR + BgRcosd dR A dq>. 


b. The Differential Form of a Flux and the Divergence of the Velocity Field 

The form co n B ~ l — E>" (B, . . .) is often called a form of a flux, since when B is the 
flux velocity field (at least for n — 3), one has to integrate exactly this form to find 
the outflow (flux) through a surface. 

The differential of the form of a flux <Ug -1 is an n-form, proportional to the 
volume form. The coefficients of proportionality are called the divergence field B, 
as we know. Thus do/' B 1 = div B ■ Q" . 

We want to study the field B — B',ei itself and find its divergence div B . We 
already know how to find the form of a flux co " B ~ 1 from the field B — B' e ej . We shall 
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find it, compute its differential, and obtain an n-form, proportional to the volume 
form, whose coefficients of proportionality are the divergence of the field B . 

Let us show this. We write the (n — l)-form 1 in the following form: 

a>' B ~ l =bi(t)dt 1 A • • • A d t' A • • • A dr". 

We compute its differential 


d-r 1 



dr 1 a • • • a dr". 


We express the coefficients bj of the form w n B 1 through the coordinates B' e of the 
vector B = /i'e, : 




a--- a dr". 


We compare this form with the volume form 

&g — -\/dstgij (r ) dr 1 a • • • a dr" , 

and we obtain 


div B 


1 


y— 


3 ( V det Sij 


B\ 


V det ?i7 9f ' V Vsii 

In an orthogonal system of curvilinear coordinates, this formula takes the form 


div B — 


Y — 

2-^ a// 


3 ( ' " 8nn 


'Jgn-'-gmi 3 t l \ y/giJ 


B\ 


c. Divergence in Cartesian, Cylindrical, and Spherical Coordinates 

Problem 5 Write down formulas to calculate the divergence of a vector field 
B = in Cartesian, cylindrical, and spherical coordinates of the Euclidean 
space R 3 . 

Answer In Cartesian coordinates ( x,y,z ), cylindrical coordinates ( r,q>,z ), and 
spherical coordinates (R, <p, 9) of the Euclidean space R 3 , the divergence div B of 
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the vector field B — B' e ej can be calculated according to the formula 


div B — 


djh 

dx 


dBy 

dy 


dBz 

3 z 


1/3 rB r 3 35- _ 


V 3 r 


3 (p 


3 z 


R 2 


COS (p 


8R 2 cos9Br dRBy dRcosOBg 


3 R 


3 <p 


89 


D.2.4 Curl of a Vector Field and Its Coordinate Representation 

a. Correspondence Between a Vector Field A and the Vector Field B = curl A 

We shall now consider the special 3-dimensional case. We shall assume, as before, 
that we are given a metric gij (f)dr' dr- 7 in the curvilinear coordinates (f 1 , r 2 , r 3 ), 
generating at the same time the volume form f? 3 = v /det gij it) dr 1 A dr 2 A dr 3 . 

In this case the vector field A = A' e ej corresponds to the 1-form u>\, and the 
differential d co l A of this form, as a 2-form ( (n — l)-form), corresponds to a vector 
field B — B' e ej such that dw l A — to 2 B . This vector held B is called, as we know, the 
curl of the original held A and is denoted by curl A . 


b. The Coordinate Representation of the Correspondence Between Vector 
Fields A and B = curl A 


We wish to learn how to calculate the coordinates of the held B = curl A in terms of 
the coordinates of the vector held A. According to the procedure described above, 
from the vector held A — A' e ei we build its corresponding 1-form co l A = (A, •}: 

cl> 1 a — aj dr' = Ai dr' . 
f 8 jj 


We take its differential 


Aw 


_ 9 ( Sij 


A ! Vv ®7 


dr 


+ 


+ 


9 / gij 

9t 3 \V8jj 




a dr' = 


dt 3 \VSjj 


3 

371 


. I y!/ A 


3 

8t 2 


Sij 


^ ~^±—A 


dr 2 A dr 3 + 
dr 3 a dr 1 + 
dr 1 a dr 2 , 
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considering this form a form of type a> 2 B . By comparing the coefficients, we have 
ar B = d a>\ — b[ dr 2 A b2 dr 3 A dr 1 + f ? 3 dr 1 A dr 2 . We obtain the coordinates B' e — 


\ftTi 7 


-bj of the vector B = curl A. 


V det lftr) 

In the case of a 3 -dimensional orthogonal system of curvilinear coordinates 
(r 1 , r 2 , r 3 ), the formula simplifies. In this case, 


d "A = 




^(V^))drAdr 3 + 

-^i(^))* 3 Ad / 1 + 

-^(VgTTA^d^Adr 2 , 


and using the notation E ; = ga, it is possible to write the coordinates of the vector 
curl A = B - B]e\ + B 2 e 2 + B 3 ey. 


_ 1 ( 

^9A 3 V^ 

9A 2 VeT\ 

JE2E2 ' 

v dt 2 

dt 3 ) 

_ 1 ( 

^A\s[El 

9A 3 V^\ 


v dt 3 

dt 1 ) 

_ 1 ( 

^9A 2 V£T 

dAlSE~A 

VC 1 C 2 ' 

v at 1 

dt 2 ) 


which means that 


1 

curl A = — 

V Ei E2E3 


\l~E\e\ 
9 1 


\fE2e2 VE^e-i 

9 2 83 

VE2.A 2 V^A 3 


c. Curl in Cartesian, Cylindrical, and Spherical Coordinates 

Problem 6 Write down the formula to calculate the curl of a vector field A = 
A\e\ + A 2 e 2 + A^je 3 in Cartesian, cylindrical, and spherical coordinates of the Eu- 
clidean space M 3 . 

Answer In Cartesian (a, y, z), cylindrical (r, q>, z), and spherical ( R. <p, 9 ) coordi- 
nates of the Euclidean space, the curl (curl A) of the vector field A — A\e 1 + A 2 e 2 + 



682 


D Operators of Field Theory in Curvilinear Coordinates 


A ^3 is calculated according to the formula 


curl A — ( 


(dAz 
V dy 
1 (dA, 
r\d(p 

1 


9A 


dz 
drA, 


e x 


dA x 

dz 


dAz 

dx 


e v + 


3 Ay 
dx 


dA^ 

dy 


dz 


£r + I 


/ 9A r 9A Z \ 1 / drA v 9A,-\ 


V dz 


dr 




dr 


3 <fj 


R cos 6 \ d(p 


3 A e 


dAy cos 6 
W 


, 1 (d A r dRA e \ 
£R+ R \ d6 dR ) 


£w + 


1 / dRA v 1 9 A A 

R \ dR cos6 dcp ) 6 



Appendix E 

Modern Formula of Newton-Leibniz 
and the Unity of Mathematics 
(Final Survey) 


E.l Reminders 

E.1.1 Differential, Differential Form, and the General Stokes’s 
Formula 

a. What Happened and Was the Reason That Brought Us to This Kind of Life 

We already began the ascent to the modern Newton-Leibniz formula at the very 
beginning of this course of mathematical analysis, when we defined the differential 
d f{x) of a function / : X Y at the point x. By analyzing this concept gradually 
in detail, we found that it is a linear function operating on a linear vector space 
T x X of displacements from the point under consideration with values in the space 
T y Y of displacements from the point y = fix). The spaces T x X and T y Y are called 
tangent spaces to X and Y at the corresponding points. The differential itself is also 
called the tangent mapping or total derivative with respect to the original mapping 
(function) / : X -> Y at the point x. 

Once one has become acquainted with the concept of tangent line or tangent 
plane to a surface, one understands the origin and the geometric meaning of this 
terminology. 

Passing to functions of several variables and mappings of multidimensional ob- 
jects, we left the definition of the differential unchanged, but every time, we ex- 
plicitly deciphered the coordinate representation of the differential. In this way, the 
notion of the Jacobian matrix of a mapping appeared. 

We know that the differential of a function / : R" — ► R. has the form 

w 4* 1+ "'4 dI '' 

i.e., it is a linear combination of differentials of simple functions, the coordinate 
functions, and the value of the differential d/(x)(£) at the vector £ e T x WL n coin- 
cides with the value of the derivative D^fix) of the function on this vector, and 
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since d.r' (f ) = , one has 

d/w«)=^ r { , + ■+^r. 

If you are acquainted with the linear algebra of linear, multilinear, and skew- 
symmetric forms and the operation of their external product, you could, by applying 
this to differentials, write a differential form of the type 

co k (x) = fl/j... i k {x) dx 11 A • • • A dx ‘ k , 

realizing that this is a skew-symmetric k - form on the tangent space whose value 
on the set of vectors (§1, . . . , ft) can be calculated if the value of dx 11 A • • • A 
dx lk (f i , . . . , ft) is known. Lastly, this is equal to the determinant of the matrix 


^ • 

.. ^ 

$ • 



as we know from algebra (given that dx ! (f ) = f '). 

Recall that we were led to differential forms by the change of variables for- 
mula for a multiple integral. For a one-dimensional integral, the form f(x)dx, 
standing under the integral sign, dictated the correct change of variable formula 
f((p(t)) d y(t). We were concerned, as Euler was, about the fact that this was not the 
case for higher-dimensional integrals. We wanted to correct this deficiency and at 
the same time understand what we are actually integrating, since the result should 
not depend on the choice of the system of coordinates. 

Analyzing this problem, we also had to figure out a number of concepts, not only 
in algebra but also in geometry. We understood what a ^-dimensional surface is, 
curvilinear coordinates, local charts, local maps and atlas, what the orientation of 
a surface is, and how it is specified, what the border of a surface and the induced 
orientation on the border are, and finally what all of this looks like in the general 
case of manifolds of dimension k. 

We had to analyze what occurs with our objects and operations under a change 
of coordinate system. We also had to figure out the direction in which points, vec- 
tors, and functions on those objects are transferred, in particular forms under smooth 
mappings, and how exactly to implement the corresponding transfer in the coordi- 
nates. At the same time, we convinced ourselves that the operation of differentiation 
on forms is indeed invariant with respect to the choice of coordinate system. The dif- 
ferentiation of forms, in the coordinate representation, is realized in the most simple 
and natural way, 

dco k (x) = dai l ...i k (x) cLy 11 a • • • A dx ‘ k , 

which it is often taken, for this reason, as the original definition of this operation. 

Appealing to some suggestions from physics (computation of work, flux), we 
realized that we integrate differential forms not only because they solve the original 
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problem about the change of variables formula in multiple integrals, but also they 
lead to the following far-reaching generalization of the classical Newton-Leibniz 
formula: 




co 


k— 1 


This formula, frequently called the general Stokes ’s formula , rightfully should be 
called the Newton-Leibniz-Gauss-Ostrogradskii-Green-Maxwell-Cartan-Poin- 
care formula. 


b. The Problem of Primitives Yesterday and Today 

One of the very first questions in classical mathematical analysis is the question 
about the inversion of the operation of differentiation, more precisely, the question 
of whether every function / (for example, continuous) is the derivative of some 
other function, and if so, how to find the antiderivative or primitive F of the given 
function. In the language of forms, this question is whether a 1-form fix) d.v is the 
differential dT of some 0-form, i.e., a function F. 

We gave a positive answer to this question, considering everything over a numer- 
ical interval. We did not even consider any other situation. If you ask yourself the 
same question, for example, for a function identically equal to one on the circle or 
for an appropriate form A(p, you will immediately realize that the answer is nega- 
tive. There is no differentiable function on the circle whose derivative everywhere 
is equal to one. 

This is one of the manifestations of a relation between a question of global anal- 
ysis and the topology of the domain, where the question is posed and solved. 

A significant part of the following text is devoted to a deeper, although not com- 
plete, discussion of this relation. 

Generalizing the classical situation, we shall ask the following question: Given a 
differential k-form co k , we look for a (k — l)-form co k ~ i such that co k — dco k ~ l . 


c. Closed and Exact Differential Forms 

Differential forms or having a primitive (i.e., being the differential of some form 
&r 1 : co k = d&r 1 ) are called exact forms. 

We shall easily prove that an obvious necessary differential condition for the 
exactness of a form co k is the equality d co k = 0, due to the fact that the external 
redifferentiation of any differential form is identically zero. 

If the differential of a form is equal to zero, the form is called closed. 

Thus, closedness is a necessary condition for the exactness of a form. 

Previously, we considered in all details and interpretations the case of 1 -forms. 
We also convinced ourselves that although closedness is a necessary condition for 
exactness, this condition is not sufficient, and it is significantly associated with the 
topology of the domain in which the problem is posed. 
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In physics, potential vector fields play an important role. If we have a scalar 
product (, } (or a nondegenerate bilinear form) in some space, then there arises a 
correspondence between linear functions (forms) and vector fields, defined by the 
equality w l A (x)( f) = ( A(x ),§}. Incidentally, when we want to calculate the work 
that should be done by a vector field along a path y , then we just integrate the 
form co\, called a work form. The remarkable characteristic of potential vector fields 
is that the work on those fields depends only on the beginning and the end of the 
path of transition and is equal to the difference between the values of the potential 
generating this field. In particular, the work on a closed contour (a cycle) with such 
a vector field is zero. 

In the language of vector fields, the differential characteristic of a potential vector 
field is, as we know, that they have no rotation (their curl vanishes). We also know 
that irrotational vector fields are not always potential vector fields, and it depends on 
the topology of the domain on which they act. In a simply connected domain, this 
necessary characteristic is also sufficient. For example, in a three-dimensional ball 
or a ball with deleted center, or in a cut-out ball, every irrotational field is a potential 
field; in the two-dimensional disk this is also the case, but in the disk with the center 
deleted, it is no longer the case. (Recall the typical example: in writing the form d tp 
in Cartesian coordinates ( x , y), we considered the vector field (— y,x)/{x 2 + y 2 ) 
corresponding to it.) 

Along with the necessary differential condition of exactness of a form, which 
“feels” the form locally, we had an integral criterion for exactness of 1 -forms, con- 
sisting in the fact that the integral of a form over any cycle (closed path) lying in the 
considered domain is always equal to zero. 

This integral criterion for the exactness of forms remains true with respect to 
forms of any degree, with the proper understanding of what the cycle of the corre- 
sponding dimension should be. 

This is one of de Rham’s theorems, which has as a consequence a much older 
theorem, also called Poincare’s lemma , asserting that in the space R", in a ball, or 
on any other domain homeomorphic to it, every closed form is exact. 


E.1.2 Manifolds, Chaim, and the Boundary Operator 

a. Cycles and Boundaries 

In the previous Stokes’s formula we have geometric objects (curves, surfaces, mani- 
folds, and their boundary, i.e., the border), on which we integrate the corresponding 
differential forms. 

Similar to the operator d of differentiation, we have the operator 3, which maps 
surfaces to their boundary. The boundary 3 M k of a manifold M k is also a manifold, 
but with one dimension fewer. Moreover, the variety 3 M k no longer has a boundary, 
i.e., the reapplication of the operator 3 always gives the empty set. In this sense, the 
operators d and 3 are similar. But if the operator d increases the dimension of the 
object by 1, the operator 3 reduces the dimension by 1. 
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The concepts of closedness and exactness in forms correspond here to the con- 
cepts of cycles and boundaries. 

A compact surface, a manifold M k (later we shall say also chain) of dimension k, 
is called a cycle of dimension k if 3 M = 0, i.e., M does not have any boundary 
points. 

Thus, the sphere of dimension k is a cycle of dimension k. 

A surface, manifold M k (a chain), is called a boundary if it has a “primitive” in 
the sense that there is a surface or manifold M k+l (chain) such that 3 M k+l — M k . 

It is clear that if the surface or manifold is the boundary of some other compact 
manifold, then it must be a cycle. However, the situation here is similar to that of 
forms, where the conditions are necessary but in general not sufficient to ensure that 
in the domain where this cycle lies, there is also a manifold such that the cycle is 
the boundary of that manifold. 

Take, for example, a circular ring, or annulus, in the plane. Then every circle 
containing the hole is a cycle, but it is not the boundary of a manifold lying on 
the annulus. But if instead of an annulus we consider a disk, then the situation is 
radically different. 

Let us consider the boundary of the annulus, and we shall recall the following 
fact. The operator 3 acting on boundaries is not a simple set-theoretic transforma- 
tion. On an atlas of the surface or manifold, this operator gives an atlas of the bound- 
ary, which is called the induced atlas of the boundary. If the original atlas consists 
of compatible charts, then under this operator, the induced atlas will also have this 
property. Thus if the manifold is orientable, then its boundary possesses an orienta- 
tion, which is called the induced orientation or agreed or compatible orientation of 
the boundary. 

If the annulus G that we just discussed is oriented with the standard left frame of 
the Cartesian coordinates in the plane, then its boundary, consisting of two circles 
Vi, Y2, will be oriented such that the outer circle y2 goes in the positive direction 
(counterclockwise) and the inner circle is negatively oriented (clockwise). The in- 
tegral in such a boundary is reduced to the difference between the integrals over y\ 
and Y 2 - It is useful to write that as 3 G — Y 2 + — yi+. 

For example, if you need to calculate the work that is accomplished by five turns 
along the path yi+, then three along the path yi+- and finally two along yi - > then 
you have to integrate over the chain 5y2+ + 3yi+ + 2 y 2 ~ = 5/2+ + 3yi+ — 2 / 2 + = 
3/2+ + 3yi+. The integration over such chain corresponds, of course, to a linear 
combination of the integrals over yi+ and y 2 +. 

This discussion illustrates why it is useful to consider linear combinations of 
geometric objects. These are called chains. We have explained here only where 
the concept of chains comes from, what are they in general, and where and why 
they are useful. We are not going into general and formal definitions, since we do 
not need them here in the more general form, and they can be found in the book. 
Analogously, just as in analysis, when we are forced to go from the usual ordinary 
functions to generalized functions, in geometry one goes from the simplest objects 
like cubes and chains of cubes to their generalizations like singular cubes and chains 
of singular cubes. Moreover, we then do the next extension and invent the concept 
of flux, which combines differential forms, generalized functions, and manifolds. 



688 


E Modern Formula of Newton-Leibniz 


b. Homological Cycles 

We shall see below that it is sometimes possible to calculate the integral of a form 
over a cycle by going to some other cycle, sometimes significantly simpler, which 
is in some way associated with the original cycle. This is a remarkable, important, 
and useful fact, which is used in different areas of mathematics and its applications. 

In order to understand the relation between cycles, we have to consider the fol- 
lowing fact: their difference must be the boundary of an object lying on the domain 
we are considering. We say that such cycles are homologous in this domain. 

For example, two closed oriented paths yi+, y2+ on a domain I) or on a mani- 
fold M are homologous if we can find an orientable surface Sj. C D C M) such 
that 3S\ — y 2 + - yi+. 

Thus, the circles Y] + , Y 2 + considered above are homologous in the annulus G+. 

Since the operator 3 acts on boundaries and is extended by linearity over chains, 
it is possible to determine the homology of chains. 

For instance, the chains yi+ and 2y2+ are not homologous on the annulus G + . 

We shall discuss the role and applications of the concept of homology of cycles 
in the context of the integration of differential forms. 


E.2 Pairing 

E.2.1 The Integral as a Bilinear Function and General Stokes’s 
Formula 

a. The Integral of an Exact Form over a Cycle and of a Closed Form over a 
Boundary 

We introduce first some useful notation. 

Let Q ( M ) denote the whole set of differential forms on a manifold (or surface) 
M, and let £2 k {M) denote the subset of forms of order k (i.e., A-forms), Z k (M) its 
subset of closed A-forms, and B k (M ) its subset of exact A-forms. 

Analogously, let C(M ) be the set of chains on a manifold (or surface) M, and 
let C k ( M ) be the subset of chains of dimension A (A-chains), Z k (M) the subset of 
cycles (A-cycles), and B k ( M ) its subset of boundary cycles (A -boundaries). 

Thus, £2(M) D C2 k (M) D Z k (M ) D B k (M) and C(M) D C k (M) D Z k (M) D 
B k (M). 

As long as we do not change the manifold M on which we wish to calculate 
something, in order to simplify the notation we shall remove the symbol M when- 
ever it does not lead to confusion, that is present in the just-discussed notation. 

Now we shall make a concluding remark. 

Consider the integral of an exact form b k e B k over the cycle z k e Z k and of a 
closed form z k e Z k over a boundary b k e B k . Employing Stokes’s formula, we find 
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that 


and 





k ~ l =0 


0 = 0 . 


b. Integral of a Closed Form over a Cycle and Its Invariance Under Certain 
Changes of the Form and the Cycle 

The remark that we just made leads to the following important and very useful 
conclusion. 

We shall consider now the integral of a closed form z k over a cycle Zk ■ Given that 
the addition of an exact form b k to a closed form z k gives again a closed form (since 
d (z k + b k ) = d; A + d b k = 0), and the addition of a boundary cycle bk to a cycle zk 
gives again a cycle (since 3 (bk + Zk ) = dbk + dzk = 0), recalling the remark we just 
made, we can now write the following chain of equalities: 

[ z k = f(z k + b k )= [ (z k + b k )=f [**]. 

J Zlc J Zk Zk~\~bk J[Zk\ 

Here [z* 3 ] means the class of forms that differ from the original form z k modulo 
an exact form, and [z*] is the class of cycles differing from the original one up to a 
boundary cycle. 

Thus by calculating the integral of a closed form z k over a cycle Zk, we can afford 
to choose, without changing the value of the integral, any cycle from the class [zk\ 
and any form from the class [z*]. 


E.2. 2 Equivalence Relations ( Homology and Cohomology) 

a. Toward Uniformity in Terminology: Cycles and Cocycles, Boundaries 
and Coboundaries 

Along with the unification of notation, it is convenient to agree on the following 
standardization of terminology. Since the elements of the sets Zk and Bk are called 
cycles and boundaries, respectively, we shall call the elements of Z k and B k cocy- 
cles and coboundaries, respectively. 

Thus a cocycle is a closed differential form, and a coboundary is an exact differ- 
ential form. 
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b. Homology and Cohomology 

A class [z*] , or more precisely a class [zjfc](M), is called a homology class of the 
cycle Zk on the manifold (or surface) M. 

A class [z*], or more precisely a class [z k ](M), is called a cohomology class of 
the cocycle z k on the manifold (or surface) M. 

The operator 3 taking boundary chains is called a boundary operator , and the 
operator d acting on differential forms is called a coboundary operator. 

Two cycles are homologous on the manifold (or surface) M if their difference is 
the boundary of a chain lying on M. 

Two cocycles are cohomologous on the manifold (or surface) M if their differ- 
ence is a coboundary on M (i.e., two closed forms are cohomologous on the mani- 
fold if their difference is an exact form on the manifold). 


E.2.3 Pairing of Homology and Cohomology Classes 

a. The Integral as a Bilinear Function 

The integral f or of a A-form over a chain on some manifold M can be considered 
a pairing ( or ,Ck ) of objects from two vector spaces, namely the linear space of 
A -forms Q k and the linear space of A-chains Ck- 

We can conclude, knowing the properties of the integral, that the operation 
(or , Ck) is bilinear. 


b. Nondegeneracy of the Bilinear Form of Pairing (de Rham Theorem) 

When we considered the above pairing between cycles and cocycles, we obtained 
an important result, which can be stated now in the following form: 

[z k , Zk) = [[z k ], [Zk])- 

Recalling the definition of the cohomology and homology classes \z k ], \z.k\, 
we can say that they are elements of the quotient space H k := Z k / B k and If 
Zk/ B^, respectively. 

The vector spaces H k and Hk, whose complete notation is H k (M ) and Hk(M), 
are called the space of A -dimensional cohomology of the manifold M and the space 
of A -dimensional homology of the manifold M, respectively. 

Thus, the integral actually also pairs cohomology and homology classes. The 
pairing ([z*], [z*]) is clearly linear and is nondegenerate, as was shown by de Rham. 

(Recall that a bilinear form (, ) is called nondegenerate if once we fix one of the 
arguments with a nonzero value, the form is not identically zero with respect to the 
other argument.) 
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c. Integral Criterion for the Exactness of a Closed Form 

De Rham’s theorem that we just mentioned implies the following criterion of exact- 
ness of a closed form: A closed form z k = co k on a manifold ( surface , domain) M 
is exact on M if and only if the integral of this form over every k-dimensional cycle 
lying on M is equal to zero. 

Indeed, if (z , zk) = 0 for every cycle Zk lying on M, then according to de Rham’s 
theorem, [z k ] = 0 in H k = Z k / B k . This means that z k e B k . 

We have examined in detail all aspects for the case of 1 -forms, and we also 
proved this criterion in this case. We have now established this criterion in general. 

In particular, you can now say by looking at a manifold or domain where there is 
an irrotational vector field or a divergence-free vector field whether the vector field 
is a potential, or it has a vector potential (i.e., it is the curl of some vector field), 
respectively. 

We can also use de Rham’s theorem on the second argument, of course. For 
example, if we know that on some manifold all the closed k-forms are exact, we can 
say that on this manifold every k-cycle is a boundary cycle (homologous to zero). 
Thus, we have a conclusion about the topology of the manifold. 


E.2. 4 Another Interpretation of Homology and Cohomology 

a. Duality of Operators d and 9 

In the notation of the pairing {or , ck), Stokes’s formula has the form 

[dco k ~ x ,Ck) = (ft/ -1 , dck ), 
showing the duality between the operators d and 9 . 

b. The Operators d and 9 as Mappings 

In some cases, it is useful to write the full notation of the operators d and 9, for 
example, in the notation of the following sequences of linear mappings: 

... Xv k+l 

3k -l _ 3(r _ 3,6+1 dlc+2 

■ ■ ■ < (-k-1 c k < Cjfc+1 < . 

Using the standard notations Ker and Im for the kernel and the image of a linear 
mapping, we can write, for example, that 

Z k = Kerdk, Z^ — Kerdk, B k =lmdk~i, B^ — lrndk+i, 

and thus 

H k — Ker <4 / Im dk - 1 and Hk — Ker dk / Im dk+ 1 . 
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E.2.5 Remarks 

A few words as a conclusion. 1 repeat that this is just an overview, an overview of 
the principles that does not go into details. The details are covered in the textbook, 
and numerous developments are given in the specialized literature, which is easier 
to read with an initial idea of the subject, of course. 

In physics and mechanics, we often speak in the language of vector fields. How- 
ever, you now know how to translate problems in the language of vector fields into 
the language of differential forms, and conversely you know how to relate standard 
operators like grad, curl, div with the operator d of the exterior differentiation of 
forms. 

In continuum mechanics, the Hamiltonian operator V is used. Some techniques 
that are used with it are presented in the text. There you will also find the answer 
to the question of how to represent and calculate the operators grad, curl, div in 
curvilinear coordinates. 

All of this, including Stokes’s formula, has numerous applications. For example, 
look at the deduction of Euler’s equation in continuum mechanics, or write down 
Maxwell’s equations for an electromagnetic held. I shall not mention the internal 
mathematical applications in analysis, especially complex analysis, geometry, alge- 
braic topology. . . 
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Index of Basic Notation 


Logical symbols 


logical consequence (implication) 
logical equivalence 
equality by definition; colon 
on the side of the object defined 


Sets 

E closure of the set E 

d E boundary of the set E 

E := E\dE interior of the set E 
B(x,r) ball of radius r with center at x 
S(x. r ) sphere of radius r with center at x 

Spaces 

(X,d) metric space with metric d 

( X , r) topological space with system r of open sets 

R"( C") n -dimensional real (complex) space 

R 1 = HR (C 1 = C) set of real (complex) numbers 

x = (x 1 , . . . , x") coordinate expression of a point of n -dimensional space 
C(X, Y ) set (space) of continuous functions on X with values in Y 

C[a , b] abbreviation for C([a, b ], M) or C([a, b], C) 

C lk) (X. Y) set of mappings from X into Y that are k times continuously differen- 
tiable 

C (k) [ci, b] abbreviation for C (k) ([ci, b], R) or C <k) ([a, b], C) 

C p [a , b] space C[a, b ] endowed with norm ||/|| p 

C 2 [a,b ] space C[a,b ] with Hermitian inner product (/, g) of functions or 

mean-square deviation norm 

'IZ( E) set (space) of functions that are Riemann integrable over the set E 

lZ[a,b] space 71(E) when E = [a, b] 
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Index of Basic Notation 


THE) space of classes of Riemann integrable functions on E that are equal 

almost everywhere on E 

1Zp(E)(1Z p (E)) space 1Z(E) endowed with norm \\f\\ p 

1Z2(E)(1Z2(E)) space 1Z(E) endowed with Hermitian inner product (/, g) or 
mean-square deviation norm 

lZ p [a, b ], 72-2 [« , b\ spaces 7 Z p (E) and 712(E) when E — [n, b] 

C(X\ F), {C(X i, . . . , X n ; 7)) space of linear (n-linear) mappings from X (from 
(Ii x x X,,)) into y 

TM p or TM(p), T p M ,T p (M) tangent space to the surface (manifold) M at the 
point p e M 

S Schwartz space of rapidly decreasing functions 

T>(G) space of fundamental functions of compact support in the domain G 

T>'(G) space of generalized functions on the domain G 

V an abbreviation for V(G) when G = R” 

V an abbreviation for D'(G) when G — R" 

Metrics, norms, inner products 

d (x i , X 2 ) distance between points x\ and xi in the metric space ( X , d ) 

\x | , ||x|| absolute value (norm) of a vector x e X in a normed vector space 
|| A || norm of the linear (multilinear) operator A 

\\f\\p:=(f E \f\ p (x)dx) 1 / p ,p> 1 integral norm of the function / 

|| y || 2 mean-square deviation norm (||/||p when p — 2) 

(a, b) Hermitian inner product of the vectors a and b 

(/, S) := Je (/" ’ Hermitian inner product of the functions / and g 

a • b inner product of a and b in R 3 

a x b vector (cross) product of vectors a and b in R 3 

(a, b, c) scalar triple product of vectors a, b, c in R 3 

Functions 

go f composition of functions / and g 

f~ l inverse of the function / 

fix) value of the function / at the points x; a function of x 

f(x l ,...,x n ) value of the function / at the point x = (v 1 , . . . , x") e X in 
the n -dimensional space X ; a function depending on n variables 

1 n 

X , . . . , X 

supp / support of the function / 

J f(x) jump of the function / at the point x 

[f t : t e T] a family of functions depending on the parameter t <= T 

{/„; n e N} or {/„} a sequence of functions 

ft — y f on E convergence of the family of functions { />; t e T } to the function / 
^ on the set E over the base B in T 

ft =4 f on E uniform convergence of the family of functions { /,: t e T) to the 
B function / on the set E over the base B in T 

f = o(g) over B 1 asymptotic formulas (the symbols 
f — O ig) over B > of comparative asymptotic behavior 
/ ~ g or / ~ g over B I of the functions / and g over the base B) 
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fix ) ~ J2Zi ( Pn( x ) over B expansion in an asymptotic series 

T>(x) Dirichlet function 

exp(A) exponential of a linear operator A 

B(a, fi) Euler beta function 

r(a) Euler gamma function 

Xe characteristic function of the set E 

Differential calculus 

f'(x), f x (x), d fix), D fix) tangent mapping to / (differential of /) at the point x 
, dj fix), Dj f ( x ) partial derivative (partial differential) of a function / depend- 
ing on variables x 1 , . . . , x" at the point x = (x 1 , . . . , x n ) with respect to 
the variable x‘ 

D y f (x) derivative of the function / with respect to the vector v at the point x 

V Hamilton’s nabla operator 

grad / gradient of the function / 

div A divergence of the vector field A 

curl B curl of the vector field B 

Integral calculus 

IJ.(E) measure of the set £ 
f E f(x)dx 

/(x 1 , . . . , x n ) dr 1 • • • dx" 

• • f fix 1 , . . . , x n ) dx 1 • • • dx" 

E 

J y dy f x fix, y ) dx iterated integral 

f p d v + () d ' + £ d ~ ) curv ili near integral (of second kind) or the 

/ _ , r .2 1 work of the field F = (P, Q, R) along 

f F-ds, f (F, dy) , , v ^ ° 

J y J y ' ) the pathy 

f f ds curvilinear integral (of first kind) of the function / along the curve y 

integral (of second kind) over 

ff s P dy Adz+ Qdz A dx + R dx A dy 1 the surface S in R 3 ; flux of 
ff s F-dcr, ff x (F,dar) j the field F = (P, Q, R) across 

the surface S 

fils f d a surface integral (of first kind) of / over the surface S 

Differential forms 

a) (w p ) a differential form (of degree p ) 
a) p A co q exterior product of forms a) p and co q 
dm (exterior) derivative of the form w 

Im a) integral of the form co over the surface (manifold) M 

(F, •) work form 
&>y := (V, •, •) flux form 


Se 

/• 


integral of the function / 
over the set £cK" 
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Symbols 

5 -function, 281 
S -neighborhood, 5, 11 
e-grid, 17 
ifc-cell, 255 

^-dimensional volume, 186 
/r-path, 255 

n -dimensional disk, 180, 323 
nth moment, 402 
p-cycle, 358 
p-form, 198 
ri space, 14 
T 2 space, 14 
8 -formula, 586 

A 

Abel summation, 392 
Abel-Dirichlet test, 375, 376, 378 
Abel’s transformation, 376 
Absolute convergence 
of a series, 374, 375 
of an improper integral, 4 1 7 
of functions, 374 
Adiabatic, 224 
Adiabatic constant, 226 
Adjoint mapping, 317 
Admissible set, 119 
Alexander horned sphere, 1 64 
Algebra 

exterior, 319, 320 
graded, 314 
Grassmann, 319, 320 
Lie, 72, 348 
of forms, 313 

skew-symmetric, 314 
of functions, 399 
complex, 399 


real, 399 
self-adjoint, 402 
separating points, 400 
Almost everywhere, 114 
Alternation, 314 
Amplitude, 554 
Amplitude modulation, 580 
Analysis, harmonic, 554 
Angular velocity, 69, 70 
Approximate identity, 45 1 
Area 

as the integral of a form, 229-23 1 
Minkowski outer, 195 
of a Ar-dimensional surface, 188, 231 
of a piecewise-smooth surface, 23 1 
of a sphere in R", 193, 440 
Asymptotic equality, 590 
Asymptotic equivalence, 590 
Asymptotic estimate, 589 
uniform, 602 
Asymptotic expansion 

in the sense of Erdelyi, 60 1 
uniform, 602 
Asymptotic formula, 590 
Asymptotic methods, 589 
Asymptotic problem, 588 
Asymptotic sequence, 593 
Asymptotic series, 593, 594 
general, 593 

in the sense of Erdelyi, 602 
in the sense of Poincare, 594 
power, 598-600 
Asymptotic zero, 595 
Asymptotics, 588, 589 
of a Bessel function, 631 
of a Laplace integral, 629 
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of canonical integral, 624 
of Legendre polynomials, 614 
of the Fourier integral, 625, 630 
of the gamma function, 613, 620 
of the Laplace integral, 625 
of the probability error function, 6 1 8 
Atlas 

analytic, 326 

of a surface (manifold), 164, 321 
of class C (k> , 326 
orienting, 175, 328 
smooth, 326 

Average value of the function in the sense of 
Levy, 664 

Averaging of a function using a kernel, 489 

B 

Ball 

closed, 5 

in R", volume, 440 
in a metric space, 5 
Band-limited signal, 578 
Base 

in the set of partitions, 110 
of a topology, 10 
Basis of a vector space, 494 
Bernoulli integral, 310 
Bernoulli numbers, 626 
generating function, 626 
Bernoulli polynomials, 548 
Bessel function, 390, 408 
asymptotics, 63 1 

Bessel’s equation, 390, 408, 411, 430 
Bessel’s inequality, 502, 503 
for trigonometric system, 527 
Beta function, 433 — 435 
Bicompact set, 15 
Borel’s formulas, 573 
Boundary, 687 

of a half-space, 179 
of a manifold, 322 
of a p-dimensional cube, 357 
of a surface, 179 
Boundary cycle, 358 
Boundary operator, 690 
Boundary point, 6, 13, 322 
Boundedness 
total, 395 
uniform, 395 

uniform, of a family of functions, 37 1 
Brachistochrone, 93-95 
Bracket, Poisson, 348 
Bundle of tangent paths, 347 


C 

Canonical embedding, 350 
Canonical integral 
asymptotics, 624 
Cardinal sine, 581 
Carnot cycle, 227 

Cartesian, cylindrical, and spherical 
coordinates, 674 
Category of a set, 28 
Cauchy integral, 310 

Cauchy-Bunyakovskii inequality, 448, 499 
Cauchy-Riemann equations, 310 
Central limit theorem, 664, 666 
Chain, 687 

of charts, 330 

contradictory, 330 
disorienting, 330 
of singular cubes, 357 
Chains, 687 

Change of variable in an integral, 137-145, 
157-159 

Change of variables formula, 655 
Channel with noise, 668 
Characteristic 
frequency, 556 
phase, 556 
spectral, 556 
Chart 

local, 163, 321 
parameter domain, 163 
range, 163 
range of, 321 

Charts, consistent, 175, 328 

Chebyshev metric, 3 

Cheby shev polynomials, 518 

Cheby shev-Laguerre polynomials, 518 

Circulation of a field along a curve, 235, 278 

Class 

orientation, 329 

Closed and exact differential forms, 685 
Closed ball, 5 
Closed form, 353 
Closed set, 1 3 

in a metric space, 5, 6 
in a topological space, 13 
Closure, 6, 13 
Coboundary operator, 690 
Coding of a signal, 668 
Coefficient 

of thermal conductivity, 303 
of thermal diffusivity, 304 
Coefficients 

Fourier, 500-502, 504, 507, 512, 519, 525, 
530, 534, 535, 544, 546, 550 


Subject Index 


705 


extremal property, 500 
Lame, 268, 275 

Cohomologous on the manifold, 690 
Cohomology, 689, 690 
Cohomology class, 690 
Cohomology group, 356 
Compact set, 15 
elementary, 160 
in a metric space, 16 
Companion trihedral, 73 
Complete system of vectors, 505-510 
Completeness of the trigonometric system, 539 
Completion of a space, 24-27 
Concentration of measures, 663 
Condition 

necessary, for convergence, 373 
Conditions 

Dini, 528, 563 
Connected set, 19 
Consistent charts, 175, 328 
Constant, cyclic, 298, 360 
Content of a set (Jordan), 121 
Continuity 

and passage to the limit, 382 
of an improper integral depending on a 
parameter, 420^4-22 

of an integral depending on a parameter, 
406, 407 

Continuous group, 72, 336 
Contribution 

of a maximum point, 607 
Contribution of a point to asymptotics, 606 
Convergence 
absolute, 374 

of an improper integral, 417 
in mean, 521, 539 
of a family of functions 
pointwise, 363, 367 
uniform, 367 
of a series of vectors, 499 
of an improper integral, 154 
Cauchy principal value, 157 
of distributions, 459 
of generalized functions, 459 
of linear functionals 
strong (norm), 58 
of test functions, 458 
uniform, 395 

Cauchy criterion, 369, 370 
weak, 459 

Convergence set, 363, 367 
Convolution, 444^166 
differentiation, 449 
in R", 478 


multidimensional, 478^-88 
symmetry, 448 
translation-invariance, 449 
Coordinate parallelepiped, 1 09 
Coordinates 

Cartesian, 259, 261 
curvilinear, 163, 265 
cylindrical, 268-274 
of a tangent vector, 340 
polar, 168 

spherical, 168, 268-274 
triorthogonal, 268-274 
Cotangent space to a manifold, 340 
Covering 

locally finite, 337 

refinement of another covering, 337 
Criterion 

Cauchy, 419 

for uniform convergence, 369, 370, 374 
for uniform convergence of a series, 377 
for uniform convergence of an integral, 
415 — 417 

Darboux, 116-118, 131 
for a field to have a potential, 29 1 
for compactness in a metric space, 17 
for continuity of a mapping, 3 1 
Lebesgue, 114-116, 119, 122, 140, 147 
Critical point, 151 
Cube 

boundary of, 357 
singular, 357 

boundary of, 358 
Curl, 204, 260, 275, 680 

physical inteipretation, 282, 283 
Curl in Cartesian, cylindrical, and spherical 
coordinates, 681 
Current function, 310 
Curvature of a curve, 73 
Curvilinear coordinates and metric, 673 
Cycle 

boundary, 358 
Carnot, 227 
of dimension p, 358 
Cycle of dimension, 687 
Cycles, homologous, 358 
Cycles and boundaries, 686 
Cyclic constant, 298, 360 
Cyclic frequency, 554 
Cylinder, 169 

D 

Darboux integral 
lower, 117 
upper, 117 
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Darboux sum 
lower, 1 16, 394 
upper, 1 1 6 

De Rham theorem, 686, 690 
Deformation (of a closed path), 294 
Degree (order) of a differential form, 196 
Delta function (5-function), 281, 445, 450, 
456, 457, 459, 464, 469, 479, 490, 
552,583, 586 
shifted, 479 
Derivation 

of a ring, 352 
Derivative 
Lie, 352 

of a mapping, 61,62 
of order n , 8 1 
partial, 70, 71 
second, 81, 84 
with respect to a vector, 82 
Derivative mapping, 62 
Deviation 

mean-square, 522 
Diffeomorphism, elementary, 142 
Differential 

exterior, 343, 349 

exterior, of a form, 202, 343, 349 

of a mapping, 61 

of order n, 81 

partial, 70, 71 

second, 81, 84 

total, 7 1 

Differential equation with variables separable, 
228 

Differential form, 198 
closed, 297, 353 
exact, 296, 353 
flux, 199 
of class C®, 342 
of compact support, 344 
of order zero, 202 
on a manifold, 341 
on a smooth surface, 209 
restriction to a submanifold, 350 
work, 199 

Differential operator, 481 
adjoint, 481 
self-adjoint, 481 
transpose, 481 
Differentiation, 61 

at a point of a manifold, 348 
of a family of functions depending on a 
parameter, 387-391 
of a Fourier series, 538 
of a generalized function, 461—464 


of a power series, 389 
of a series, 389 
of an integral 

over a liquid volume, 491 
of an integral depending on a parameter, 
407-410, 478 
on a manifold, 348, 351 
with respect to a parameter, 423^125 
Dimension of a manifold, 321 
Dini conditions, 528 
Dipole, 299, 490 
Dipole moment, 300, 490 
Dipole potential, 300 
Direct product of metric spaces, 8 
Direction 

of circuit around a domain, 178 
Direction of motion along a curve, 178 
Dirichlet discontinuous factor, 432 
Dirichlet integral, 431, 443, 563, 601 
Dirichlet kernel, 525, 551 
Discontinuous factor, Dirichlet, 432 
Discrete group of transformations, 336 
Discrete metric, 2, 8 
Disk, 7i -dimensional, 180, 323 
Distance, 654 
Distribution, 456 
regular, 459 
singular, 459 
tempered, 584 

Divergence, 204, 260, 275, 676 
physical interpretation, 279-282 
Divergence in Cartesian, cylindrical, and 
spherical coordinates, 679 

Domain 

elementary, 241 

fundamental, of a group of automorphisms, 
325, 336 

of parameters of a chart, 321 
parameter, 366 
simply connected, 293 
Double layer, 490 

E 

Efficiency of a heat engine, 227 
Eigenvalue, 511 

of a Sturm-Liouville problem, 5 1 1 
Eigenvector of an operator, 512 
Element of volume, 229 
Elementary diffeomorphism, 142 
Elliptic integral, 392 
complete 

of first kind, 392, 408 
of second kind, 392, 408 
modulus, 408 
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Embedding 

canonical, 350 
Entropy, 302 

Envelope of a family of curves, 253 
Equality 

asymptotic, 590 

Parseval’s, 513, 523, 541, 562, 585 
Equation 

Bessel’s, 390, 408,411,430 
differential, 35, 228 
Euler-Lagrange, 92 
Euler’s 

hydrodynamic, 307 
heat, 303, 576 
hypergeometric, 392 
Laplace’s, 304, 517 
Mayer’s, 226 
of an adiabatic, 226 
of continuity, 305, 306 
of state, 223 

Poisson’s, 299, 304, 488 
wave, 308, 309,311,574 
homogeneous, 309 
inhomogeneous, 309 
Equations 

Cauchy-Riemann, 310 
electrostatic, 286 
magnetostatic, 286 

Maxwell, 262, 263, 275, 282, 299, 311 
Equicontinuity, 396 
Equivalence, asymptotic, 590 
Equivalent atlases 

with respect to orientation, 176, 329 
with respect to smoothness, 326 
Erdelyi’s lemma, 625 
Error function 

asymptotics, 618 
Estimate, asymptotic, 589 
uniform, 602 
Euler, 433 

Euler-Gauss formula, 436 
Euler-Lagrange equation, 92 
Euler-Poisson integral, 429, 438, 560, 572, 
601 

Eulerian integral, 433^144 
Euler’s formula, 441, 649 
Euler’s hydrodynamic equation, 307 
Exact form, 353, 685 
Exhaustion, 660 
Exhaustion of a set, 152 
Expansion, asymptotic 

in the sense of Erdelyi, 60 1 
uniform, 602 
uniqueness, 594 


Exponential 

of an operator, 71-73 
Exponential function, 648 
Exponential function as a limit, 649 
Exponential integral, 591 
Exponential of a matrix, 650 
Exponential of operators, 650 
Exponential system, 495 
Exterior algebra, 319, 320 
Exterior differential, 202, 343, 349 
of a form, 343, 349 
Exterior point, 6,13 
Exterior product, 196, 315, 342 
Extremum of a function 
necessary condition, 88 
sufficient condition, 88 
with constraint, 107 

F 

Family of functions, 366 
equicontinuous, 396 
at a point, 401, 402 
separating points, 400 
totally bounded, 395 
uniformly bounded, 395 
Fejer kernel, 55 1 
Field 

of forms, 257 
of linear forms, 198 
potential, 288 
scalar, 257 
solenoidal, 296 
tensor, 257 
vector, 257, 348 
smooth, 348 

Filter, low-frequency, 556 
Flow 

planar, 310 
plane-parallel, 310 

Flux across a surface, 215-219, 234, 278, 303, 
491 

Force 

mass, 306 
Form 

anti-symmetric, 196 
differential 

closed, 297, 353 
exact, 296, 353 
flux, 199 

of class C <*>, 342 
of compact support, 344 
on a manifold, 341 
restriction to a submanifold, 350 
work, 199 
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Hermitian, 45 

nondegenerate, 45 
nonnegative, 45 
on a surface in R" , 229 
semidefinite, 88 
skew-symmetric, 196, 314-317 
volume in R* , 229 
Form is called closed, 685 
Formula 

asymptotic, 590 
Cauchy-Hadamard, 375 
co-area, 237 

complement, for the gamma function, 437 
Euler-Gauss, 436 
Euler’s, 441 

for change of variable in an integral, 138 
Fourier inversion, 564, 585 
Frenet, 73 
Gauss’, 443 

Gauss-Ostrogradskii, 243-246, 279, 304, 
483,491 

in vector analysis, 278 
Green’s, 491 
homotopy, 361 
Kotel’nikov’s, 579 
Kronrod-Federer, 237 
Legendre’s, 442 
Leibniz’ , 407 

Newton-Leibniz, 238, 279, 406, 566 
Poisson summation, 586 
reduction 

for the beta function, 434 
for the gamma function, 436 
Stirling’s, 444, 613 
Stokes’, 238, 277, 279, 345-347, 359 
general, 248-251, 345 
in R 3 , 246-248 
in vector analysis, 278 
Taylor’s, 95, 412 
Wallis’, 430, 444, 627 
Formulas 

Borel’s, 573 

differential, of field theory, 263-265 
Green’s, 285, 286 
integral, of vector analysis, 279 
Fourier coefficients 

extremal property, 500 
Fourier cosine transform, 557 
Fourier integral, 624 
asymptotics, 625, 630 
multiple, 632 

Fourier inversion formula, 564, 585 
Fourier series, 493-520 

in a general orthogonal system, 493, 494 


multiple, 550 

of generalized functions, 552 
partial sum 

integral representation, 524 
pointwise convergence, 520 
rate of convergence and smoothness, 534 
Fourier sine transform, 557 
Fourier transform, 555, 556, 559, 564, 567, 
571,576 

asymptotic properties, 566-569 
frequency shift, 580 
in L 2 , 582 
inverse, 576 
multidimensional, 569 
normalized, 561 
of a convolution, 562 
of generalized functions, 585 
rate of decrease and smoothness, 566 
time shift, 580 
Frame 

Frenet, 73 
orienting, 174 
Frequencies, natural, 512 
Frequency, 553 
cyclic, 554 
fundamental, 554 
harmonic, 554 

Frequency characteristic, 556 
Frequency spectrum, 586 
Fresnel integral, 431, 443, 601 
Function 

band-limited, 578 
Bessel, 390, 408 
asymptotics, 63 1 
beta, 433^135 
cardinal sine, 581 
change of coordinates, 322 
current, 310 

delta, 445, 450, 456, 457, 459, 464, 469 
Dirichlet, 364 
exponential integral, 591 
fundamental, 479 
gamma, 435^138 

asymptotics, 613, 620 
incomplete, 601 
generalized, 456 

differentiation, 461^164 
of several variables, 479 
regular, 459 
singular, 459 
generating, 467 
Green’s, 465 
harmonic, 287, 304 
conjugate, 310 
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Heaviside, 461, 464, 485, 491 
limit, 363, 367 
linear, 49 

locally integrable, 448 
multilinear, 49 

of compact support, 138, 447 
phase, 625 

piecewise continuous, 529 
piecewise continuously differentiable, 529 
probability error, 601 
asymptotics, 618 
rapidly decreasing, 568 
Riemann integrable 
over a set, 120 
over an interval, 1 1 1 
sample, 581 
sine integral, 601 
spectrum of, 554 
spherical, 517 
support, 447 
system, 445, 456 
test, 458, 479 
transient pulse, 445 
uniformly continuous, 452 
unit step, 46 1 
zeta, 443 
Functional 
linear, 49 
multilinear, 49 

Functions, asymptotically equal, 590 
Functions, asymptotically equivalent, 590 
Functions of a very large number of variables, 
663 

Fundamental domain of a group of 
automorphisms, 325, 336 
Fundamental frequency, 554 
Fundamental sequence, 21 
Fundamental solution, 464-466, 485 
Fundamental tone, 512 

G 

Galilean transformation, 587 
Gamma function, 433, 435^-38 
asymptotics, 613, 620 
incomplete, 601 
Gauge condition, 312 
Gauss’ formula, 443 
Gauss’ theorem, 491 

Gauss-Ostrogradskii formula, 238, 243-246, 
483,491 

in vector analysis, 278 
Gaussian measures, 667 
General Stokes’s formula, 683, 685, 688 
Generalized function, 456 


differentiation, 461^164 
of several variables, 479 
regular, 459 
singular, 459 

Generating function of a sequence, 467 
Gibbs’ phenomenon, 538, 549 
Graded algebra, 314 
Gradient, 204, 260, 275, 283, 675 
physical inteipretation, 283 
Gradient in Cartesian, cylindrical, and 
spherical coordinates, 676 
Gram matrix, 187 
Grassmann algebra, 319, 320 
Green’s formula, 491 
Green’s function, 465 
Green’s theorem, 238-243 
Group 

cohomology, 356 
continuous, 72, 336 
discrete, of transformations, 336 
homology, 298, 357, 358 
p-dimensional, 358 
homotopy, 298 
Lie, 72, 336 
of automorphisms, 325 
one-parameter, 35 1 
topological, 72, 336 

H 

Haar system, 519 

Hamilton operator (nabla), 262, 265 
Harmonic analysis, 554 
Harmonic frequency, 554 
Harmonic function, 287, 304 
conjugate, 310 
Harmonic polynomials, 517 
Hausdorff space, 12, 14 
Heat capacity, 224 
molecular, 225 
Heat engine, 226 
Heat equation, 303, 576 
Heaviside function, 464, 485, 491 
Hermite polynomials, 518 
Hermitian form, 45 
nondegenerate, 45 
nonnegative, 45 
Hilbert’s fifth problem, 337 
Homeomorphism, 31 
Homological cycles, 688 
Homologous cycles, 358 
Homologous on the manifold, 690 
Homology, 358, 689, 690 
Homology class, 690 
Homology group, 298, 357 
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Homotopic paths, 294 
Homotopy, 294 
Homotopy formula, 361 
Homotopy group, 298 
Homotopy identity, 352 
Hypergeometric equation, 392 
Hypergeometric series, 391 


Identity 

approximate, 45 1 
homotopy, 352 
Jacobi, 72 

Improper integral, 153 

depending on a parameter, 405 
with variable singularity, 474^178 
Improper integral depending on a parameter 
Abel-Dirichlet test, 418 
Cauchy criterion, 419 
continuity, 420^122 
limiting passage, 420^-22 
uniform convergence, 427 
Weierstrass’ M-test, 427 
Induced atlas of the boundary, 687 
Induced orientation, 687 
Induced orientation on the boundary of a 
surface, 182 

Inequality 

Bessel’s, 502, 503 

for trigonometric system, 527 
Brunn-Minkowski, 123 
Cauchy-Bunyakovskii, 46, 448, 499 
Clausius, 227 
Holder’s, 128 

isoperimetric, 195, 543-545 
Minkowski’s, 44, 128, 489 
generalized, 489 
Steklov’s, 549 
triangle, 1 
Wirtinger’s, 549 
Inertia, 307 

Inner product, 45 — 48, 352 
of a field and a form, 352 
Instantaneous axis of rotation, 70 
Integral, 110 
Bernoulli, 310 
canonical 

asymptotics, 624 
Cauchy, 310 
Darboux 
lower, 1 17 
upper, 1 1 7 

depending on a parameter, 405^-12 
continuity, 406, 407 


differentiation, 407^110 
integration, 410, 411 
Dirichlet, 287, 431, 443, 563, 601 
double, 1 1 1 
elliptic, 392 

complete, of first kind, 392, 408, 430 
complete, of second kind, 392, 408 
Euler-Poisson, 429, 438, 560, 572, 601 
Eulerian, 433-444 
first kind, 433 
second kind, 433 

Fourier, 555, 557, 560, 564, 571, 578, 581, 
624 

asymptotics, 625, 630 
multiple, 632 
Fresnel, 431, 443, 601 
Gauss’, 253 
improper 

differentiation with respect to a 
parameter, 423-425 
integration with respect to a parameter, 
425^129 
iterated, 129-131 
Laplace, 565, 604, 624 
asymptotics, 625, 629 
Lebesgue, 393 
line, 214 
multiple, 1 1 1 

depending on a parameter, 47 1 — 492 
with variable singularity, 476 
of a differential form 

over a surface, 217, 220 
of a form on a manifold, 344, 345 
of a function over a surface, 228, 233 
over a chain, 359 
over a set, 119 
over a singular cube, 359 
Poisson, 455, 581 
Raabe’s, 442 
Riemann, 393 
over a set, 119 
over an interval, 110 
surface 

of first kind, 233 
of second kind, 234 
triple, 1 1 1 

Integral criterion for the exactness of a closed 
form, 691 

Integral criterion for the exactness of forms, 

686 

Integral metric, 4 

Integral operator, 561 

Integral representation of the partial sum of a 
Fourier series, 524 
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Integral transform, 561 

Integration by parts in a multiple integral, 255 
Integration of an integral depending on a 
parameter, 410, 41 1 
Integration with respect to a parameter, 
425-429 
Interchange 

of differentiation and passage to the limit, 
387-391 
of integrals, 129 

improper, 425^129 
proper, 129 

of integration and passage to the limit, 
385-387 

of limiting passages, 381 
of summation and differentiation of a 
series, 389 
Interior point, 6, 13 
Interval in R", 109 
Isobar, 224 
Isochore, 224 

Isometry of metric spaces, 24 
Isomorphism 

of normed vector spaces, 60 
of smooth structures, 335 
Isoperimetric inequality, 195, 543-545 
Isotherm, 224 
Iterated integral, 129-131 

J 

Jacobi identity, 72 
Jacobian of a coordinate change 
cylindrical coordinates, 270 
general polar, 168 
spherical coordinates, 270 
triorthogonal coordinates, 270 
Jordan measure, 121 

K 

Kernel 

Dirichlet, 525, 551 
Fejer, 551 
Poisson, 467 
Klein bottle, 170, 325 
Kotel’nikov’s formula, 579 

L 

Lagrange’s theorem, 309 
Laplace integral, 565, 604, 624 
asymptotics, 625, 629 
Laplace transform, 604 
Laplace’s equation, 304, 517 


Laplace’s method, 603-606 
multidimensional, 623 
Laplacian, 264, 265, 274, 488 
Law 

Ampere’s, 235 
Archimedes’, 245 
Biot-Savart, 237 
Coulomb’s, 280 
Faraday’s, 235 
Gauss’, 287 

Newton’s, 289, 306, 307 
normal distribution, 466 
of conservation of mass, 280 
Law of large numbers, 664, 665 
Layer 

double, 490 
single, 488 

Legendre polynomials, 497, 498, 510, 614 

Legendre's formula, 442 

Lemma 

Erdelyi’s, 625, 630, 632 
exponential estimate, 606, 615, 627 
Hadamard’s, 412 
Morse’s, 151, 406, 624, 628, 631 
nested ball, 27 

on continuity of the inner product, 499 
on finite s-grids, 17 
on orthogonal complement, 502 
Poincare’s, 297 

Riemann-Lebesgue, 526, 563, 629 
Sard’s, 151 
Watson’s, 622, 625 
Lemma due to Levy, 665 
Lie algebra, 72, 348 
Lie derivative, 352 
Lie group, 72, 336 
Limit, 21, 28 

of a family of continuous functions, 
382-385 

of a family of functions, 369 
of a mapping, 28 
of a sequence, 2 1 
of a sequence of functions, 363 
Limit function, 363, 367 
Limit point, 5, 7, 13 
Limiting passage 
interchange, 381 

under a differentiation sign, 387-391 
under an integral sign, 385-387 
Linear transfonnation, 49 
Local chart, 321 
Local maximum, 88 
Local minimum, 88 
Localization principle 
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for a Fourier series, 526 
for a Laplace integral, 606 
Locally integrable function, 448 
Lorentz transformation, 587 
Low-frequency filter, 556 

M 

M-test for convergence, 374 
Manifold, 321-325 
analytic, 326 
compact, 324 
connected, 324 
contractible, 353 
embedded in R“, 163 
nonorientable, 329 
of class C®, 326 
orientable, 329 
oriented, 329 
smooth, 326 
topological, 326 
with boundary, 323 
without boundary, 323 
Mapping 

adjoint, 317 
bounded, 29 
continuous, 30-34 
at a point, 30 

continuously differentiable, 76 
contraction, 35 
derivative, 62 

higher-order, 8 1 
of order n , 8 1 

differentiable at a point, 61,62 
differentiable on a set, 62 
homeomorphic, 31 
linear, 49 
multilinear, 49 
of class C®, 326 
partial derivative, 70 
smooth, 326 
tangent, 61, 339, 349 
ultimately bounded, 29 
uniformly continuous, 33 
Mass force, 306 
Maximum, local, 88 

Mean convergence and completeness, 539, 541 
Mean value over a period, 554 
Mean-square deviation, 3, 522 
Measure 

of a set (Jordan), 121 
of an interval, 112 
Measure zero, 112, 113, 119 
Median value of the function, 664 
Method 


asymptotic, 589 
Fourier, 511-513 
Laplace’s, 603-606 

multidimensional, 623 
of Lagrange multipliers, 107 
of least squares (Gauss’), 513 
of separating singularities (Krylov’s), 537 
of tangents, modified 

(Newton-Kantorovich), 39 
of tangents (Newton’s), 38 
separation of variables, 511-513 
stationary phase, 624, 625 
multidimensional, 63 1 
one-dimensional, 629 
Steklov’s averaging, 513 
Method of undetermined coefficients, 652 
Metric, 1 

Chebyshev, 3 
discrete, 2, 8 
integral, 4 

of mean-square deviation, 3, 4 
of uniform convergence, 4, 395, 398 
Riemannian, 267 
Metric space, 1 
separable, 15 

Metric spaces, direct product, 8 
Minimum, local, 88 

Mobius band, 170, 177, 181, 185, 328, 331 
Modulus 

of an elliptic integral, 408 
Moment 

dipole, 300, 490 
multipole, 300 
of a function, 402 
Morse’s lemma, 624, 628, 631 
Multidimensional geometry, 663 
Multidimensional intervals, 666 
Multidimensional sphere, 664 
Multilinear transformation, 49 
Multiple integral 

depending on a parameter, 471-492 
improper, 153 
integration by parts, 255 
iterated, 129-131 
with variable singularity, 47 1 , 474 
Multiplication of generalized functions, 470 
Multipole, 300 
Multipole moment, 300 
Multipole potential, 300 

N 

Nabla (Hamilton operator), 262, 265 
Natural frequencies, 512 
Natural oscillations, 512 
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Natural parametrization, 73 
Necessary condition for uniform convergence, 
373 

Neighborhood, 13 

in a metric space, 5, 6, 9 
in a topological space, 1 1 
of a germ of functions, 1 1 
Newton-Leibniz formula, 566, 683, 685 
Newton’s binomial, 651 
Norm 

in a vector space, 42, 582 
of a transformation, 52 
of a vector, 42, 582 
Null-series, Men’shov’s, 524 
Numbers, Bernoulli, 626 
generating function, 626 

O 

One-parameter group, 351 
Open set 

in a metric space, 5 
in a topological space, 9 
Operational calculus, 46 1 
Operator 

differential, 481 
adjoint, 481 
self-adjoint, 48 1 
transpose, 481 
Hamilton (nabla), 262, 265 
integral, 561 

Laplace, 264, 265, 274, 488 
nilpotent, 71 
of field theory, 260 

in curvilinear coordinates, 265-275 
symmetric, 512 
translation, 445 
translation-invariant, 445 
Operators grad, curl, div in curvilinear 
coordinates, 674 
Orbit of a point, 325, 336 
Order (degree) of a differential form, 196 
Orientation 

induced on the boundary of a manifold, 33 1 
of a domain of space, 174 
of a manifold, 328 
of a surface, 172-178, 182 
of the boundary of a surface, 1 82 
opposite to a given orientation, 174 
Orientation class, 329 
of atlases 

of a surface, 176 
of coordinate systems, 173, 174 
of frames, 172 
Oriented space, 172 


Orienting frame, 174 
Orthogonal vectors, 494, 512 
Orthogonality with a weight, 515 
Orthogonalization, 497, 498 
Oscillation of a mapping, 30 
at a point, 32 
Oscillations, natural, 512 
Overtones, 512 

P 

Pairing of homology and cohomology classes, 
690 

Parallelepiped, coordinate, 109 
Parameter domain, 163, 321, 366 
Parameter set, 366 
Parameters 

Lame, 268, 275 

Parseval’s equality, 513, 523, 562, 585 
Partition 

locally finite, 190 
of an interval, 110 

with distinguished points, 110 
of unity, 150, 332-334 
^-smooth, 332 

subordinate to a covering, 333 

Paths 

homotopic, 294 
tangent, 347 

Period, over a cycle, 360 
Period of an integral, 298 
Phase, 554, 625 
stationary, 625 
Phase characteristic, 556 
Phase function, 625 
Piecewise continuous function, 529 
Piecewise continuously differentiable function, 
529 

Planar flow, 310 
Plancherel's theorem, 583 
Plane 

projective, 327 
Plane-parallel flow, 310 
Point 

boundary, 6 

in a topological space, 1 3 
boundary (of a manifold), 322 
boundary (of a surface), 179 
critical, 151 
exterior, 6 

in a topological space, 1 3 
interior, 6, 13 

in a topological space, 1 3 
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limit, 5, 7, 13 
of a metric space, 1 
Poisson bracket, 348 
Poisson integral, 455, 581 
Poisson kernel, 467 
Poisson’s equation, 299, 304, 488 
Polar coordinates, 167-169 
Polynomial 

trigonometric, 520 
Polynomials 
Bernoulli, 548 
Cheby shev, 518 
Cheby shev-Laguerre, 518 
harmonic, 517 
Hermite, 518 

Legendre, 497, 498, 510, 614 
Positive direction of circuit, 178 
Potential 

dipole, 300 
multipole, 300 
of a field, 288 
quadrupole, 300 
scalar, 296 
single-layer, 488 
vector, 296 

of a magnetic field, 296 
velocity, 309 
Potential field, 288 
Power series, 374 

differentiation of, 389 
Primitives, 685 

Principal value (Cauchy) of an integral, 157 
Principle 

Cavalieri’s, 134 
contraction mapping, 37 
d'Alembert’s, 307 
Dirichlet’s, 287 
fixed-point, 35, 38 
localization 

for a Fourier series, 526 
for a Laplace integral, 606 
Picard-Banach, 35, 38 
stationary phase, 624, 625, 631 
uncertainty, 583 
Probability error function, 601 
asymptotics, 618 
Problem 

asymptotic, 588 
brachistochrone, 93, 95 
curve of most rapid descent, 93 
Luzin’s, 524 
Riemann, 524 
shortest-time, 93 
Sturm-Liouville, 516 


Process 

adiabatic, 224 
quasi-static, 224 
Product 

exterior, 315, 342 
inner, 45^18, 352 

of a field and a form, 352 
of functions, 515 
of generalized functions, 470 
of manifolds, 321 
of topological spaces, 13 
tensor, 314 
Projective line, 326 
Projective plane, 327 
real, 327 

Properties of smooth mappings, 656 
Pulse 

rectangular, 580 
triangular, 581 

Q 

Quadrupole, 300 
Quadrupole potential, 300 
Quantities of the same order, 590 

R 

Raabe’s integral, 442 
Rademacher system, 519 
Random vectors, 663 
Range of a chart, 163, 321 
Rapidly decreasing function, 568 
Rectangular pulse, 580 
Restriction of a form to a submanifold, 209, 
350 

Riemann sum, 1 10 

Riemann-Lebesgue lemma, 526, 563, 629 
Riemannian metric, 267 
Rule, Leibniz, 136, 407, 449 

S 

Sample function, 581 

Sard’s lemma, 151 

Sard’s theorem, 237 

Scalar potential, 296 

Schwartz space, 585 

Schwarz boot, 195 

Separable metric space, 1 5 

Separation of points by functions, 400 

Separation of variables, 511-513 

Sequence 

asymptotic, 593 
Cauchy, 21 
convergent, 21 
uniformly, 369 
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convergent at a point, 363 
convergent on a set, 363 
fundamental, 21 
monotonic, of functions, 376 
nondecreasing, of functions, 376 
nonincreasing, of functions, 376 
Series 

asymptotic, 593, 594 
general, 593 

in the sense of Erdelyi, 602 
in the sense of Poincare, 594 
power, 598-600 
continuity of sum, 383 
Dirichlet, 379 
Fourier, 493-520 

in a general orthogonal system, 493, 
494 

multiple, 550 

of generalized functions, 552 
partial sum, 524 
pointwise convergence, 520 
rate of convergence and smoothness, 
534 

hypergeometric, 391 
of functions, 366 
power, 374 
Stirling’s, 626 
trigonometric, 520-553 
uniformly convergent 
Cauchy criterion, 377 
Set 

admissible, 119 
bicompact, 15 
Cantor, 23 
closed 

in a metric space, 5, 6 
in a topological space, 1 3 
compact, 15 

conditionally compact, 1 8 

everywhere dense, 1 2 

Jordan measurable, 121 

nowhere dense, 28 

of area zero, 191 

of content zero, 122 

of convergence, 363, 367 

of first category, 28 

of measure zero (Jordan), 122 

of measure zero (Lebesgue), 1 19, 122, 123 

of second category, 28 

of volume zero, 122 

open 

in a metric space, 5 
in a topological space, 9 
parameter, 366 


relatively compact, 1 8 
totally bounded, 1 8 
Signal 

band-limited, 578 
spectrum of, 554 
Simply connected domain, 293 
Sine, cardinal, 581 
Sine integral, 601 
Single layer, 477 
Single-layer potential, 488 
Singular cube, 357 
boundary of, 358 
Smooth structure, 326 
Smooth structures, isomorphic, 335 
Solenoidal field, 296 
Solution 

fundamental, 464—466, 485 
of the Laplacian, 485 

Space 

Banach, 43 
complete, 21 
connected, 19 

cotangent to a manifold, 340 
Euclidean, 48 
Hausdorff, 12, 14 
Hermitian (unitary), 48 
Hilbert, 48 
locally compact, 1 8 
locally connected, 20 
metric, 1 

separable, 15 
normed affine, 62 
normed vector, 42 
complete, 43 
of distributions, 458 
of fundamental functions, 479 
of generalized functions, 458, 479 
of tempered distributions, 584 
of test functions, 458, 479 
path connected, 20, 34 
pre-Hilbert, 48 
Schwartz, 584, 585 
Sobolev-Schwartz, 460 
tangent, 61 
tangent to R" , 337 
tangent to a manifold, 337, 339 
topological, 9, 10 

in the strong sense, 14 
ti, 14 
Z2, 14 

Spectral characteristic, 556 
Spectrum 

bounded, 578 
continuous, 555 
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discrete, 554 
frequency, 586 

Sphere, Alexander horned, 164 
Spherical functions, 517 
Stabilizer of a point, 336 
Stationary phase method, 624, 625 
multidimensional, 631 
one-dimensional, 629 
Stationary phase principle, 625 
Steklov’s inequality, 549 
Stirling’s formula, 444, 613 
Stirling’s series, 626 
Stokes’ formula, 238, 345-347, 359 
general, 248-251, 345 
in R 3 , 246-248 
Structure 

smooth, 326 

Sturm-Liouville problem, 516 
Submanifold, 335 

Subset, everywhere dense in C([a, i>]), 398 
Subspace 

of a metric space, 7 
of a topological space, 13 
Sum 

Darboux 
lower, 116 
upper, 116 
Riemann, 110 
Summation method 
Abel, 383, 384, 392 
Cesaro, 393 
Support 

of a differential form, 344 
of a function, 138, 447 
Surface 

elementary, 164 
k -dimensional, 163 
smooth, 174 
nonorientable, 176 
of dimension k, 179 
of measure (area) zero, 1 9 1 
one-sided, 177 
orientable, 176, 177 
oriented, 176 
piecewise smooth, 1 84 
orientable, 185 
two-sided, 177 
with boundary, 179 
without boundary, 1 80 
zero-dimensional, 184 
Surface integral 
of first kind, 233 
of second kind, 234 
Symmetric operator, 512 


System 

exponential, 495 
Haar, 519 
Rademacher, 519 
trigonometric, 495 
completeness, 539 
in complex notation, 495 
System function, 445, 456 
System of sets 

locally finite, 337 
refinement of another system, 337 
System of vectors 
complete, 505-510 
condition for completeness, 506 
linearly independent, 494 
orthogonal, 493^199 
orthonormal, 494 
orthonormalized, 494 

T 

Tangent mapping, 339, 349 
Tangent paths, 347 
Tangent space 

to R" at a point, 337 
to a manifold, 337, 339 
Tangent vector to a manifold, 338, 340, 349 
Taylor’s formula, 650 
Tempered distribution, 584 
Tensor product, 314 
Test 

Abel-Dirichlet, 375 
Weierstrass’, 374, 375 
for integrals, 422 
Test function, 458 
Theorem 
Abel’s, 379 

Arzela-Ascoli, 395-397 
Brouwer 

fixed-point, 243 
invariance of domain, 322 
Carnot’s, 227 

Cauchy’s multidimensional mean-value, 

288 

curl, 284 
Darboux’, 117 
de Rham’s, 360 
Dini’s, 384, 426 
divergence, 284 
Eamshaw’s, 286 
Fejer’s, 531, 551 
finite-increment, 74-80 
Fubini’s, 129-131, 572 
Gauss’, 285, 491 
gradient, 284 
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Green’s, 238-243 
Hardy’s, 393 
Helmholtz’, 302 
implicit function, 97-108 
inverse function, 106 
Kotel’nikov’s, 578 
Lagrange’s, 309 
Lebesgue’s 

dominated convergence, 393 
monotone convergence, 394 
mean-value 

for harmonic functions, 287, 288 
for the integral, 127 

on asymptotics of a Laplace integral, 6 1 2 
on existence of solutions of differential 
equations, 37 

on interchange of limiting passages, 381 
on the principal term of asymptotics of an 
integral, 612 

on uniform continuity, 33 
Plancherel’s, 583 
Poincare’s, 353, 357 
Pythagorean, 498 

for measures of arbitrary dimension, 
194 

sampling, 578 
Sard’s, 237 
Stone’s, 399^-01 

for complex algebras, 403 
Tauberian, 393 
translation, 580 
Weierstrass' 

on approximation by polynomials, 398, 
467, 488 

on approximation by trigonometric 
polynomials, 478, 533 
Whitney's, 171, 335 
Whittaker-Shannon, 578 
Timbre, 512 
Tone, fundamental, 512 
Topological group, 72, 336 
Topological space, 9, 10 
base of, 10 

in the strong sense, 14 
weight of, 1 1 

Topological spaces, product of, 13 
Topology 
base of, 10 

induced on a subspace, 13 
on a set, 9 
stronger, 14 
Torsion of a curve, 74 
Torus, 169 

Total boundedness, 395 


Transform 

Fourier 

asymptotic properties, 566-569 
frequency shift, 580 
in L 2 , 582 

multidimensional, 569 
of a convolution, 562 
of generalized functions, 585 
rate of decrease and smoothness, 566 
time shift, 580 
Fourier cosine, 557 
Fourier sine, 557 
integral, 561 
Laplace, 604 
Transformation 
Abel’s, 376 
bilinear, 49 
bounded, 53 
continuous 

multilinear, 55 
Galilean, 587 
linear, 49 
Lorentz, 587 
multilinear, 49, 52, 55 
trilinear, 49 

Transient pulse function, 445 
Translation operator, 445 
Translation theorem, 580 
Translation-invariant operator, 445 
Triangle inequality, 1 
Triangular pulse, 581 
Trigonometric polynomial, 520 
Trigonometric series, 520-553 
Trigonometric system, 495 
completeness, 539 
in complex notation, 495 
Trihedral, companion, 73 
Triorthogonal coordinates, 268 

U 

Uncertainty principle, 583 
Uniform boundedness, 371, 395 
of a family of functions, 375 
Uniform convergence, 395 
Cauchy criterion, 369, 370 
Uniformly convergent series 
Cauchy criterion, 377 
Unit step, 461 

V 

Value (Cauchy principal) of an improper 
integral, 157 

Vector 

angular velocity, 70 
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tangent to a manifold, 338, 340, 349 
Vector field 
central, 215 
on a manifold, 348 
smooth, 348 
Vector potential, 296 

of a magnetic field, 296 
Vector space, tangent to a manifold, 337, 
Vectors 

orthogonal, 494, 512 
Volume 

of a ball inR", 193,440 
of a set (Jordan), 121, 246 
of an interval, 109 
Volume element, 229 


W 

Wallis’ formula, 430, 444, 627 
Watson’s lemma, 622, 625 
Wave equation, 308, 309, 311, 574 
homogeneous, 309 
inhomogeneous, 309 
Weak convergence, 459 
Weierstrass M-test, 374 
Weight of a topological space, 1 1 
Wirtinger’s inequality, 549 
Work of a field, 213 

Z 

Zeta function, 443 
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